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Preface 

This databook describes the architecture of the transputer family of products and details some of the devices 
which make up that family. Items described include the 32 bit and 16 bit transputer products IMS T805, 
IMS T801, IMS T800, IMS T425, IMS T414, IMS T222 and IMS T225; the peripheral controller IMS M212; 
and the communications devices IMS C004, IMS C011 and IMS C012. For details of the military version of 
a device refer to The Military Micro-products Databook which is available as a separate publication. 

The databook first describes the transputer architecture and general features of transputer family devices. It 
then continues with the various product datasheets. 

A transputer is a single VLSI device with processor, memory and communications links for direct connection 
to other transputers. Concurrent systems can be constructed from a collection of transputers operating 
concurrently and communicating through links. The transputer can be used as a building block for concurrent 
processing systems, with Occam as the associated design formalism. 

Current transputer products include the 1 6 bit IMS T222, the 32 bit IMS T41 4 and IMS T425, and the IMS T800, 
IMS T801 and IMS T805 which are 32 bit transputers with an integral high speed floating point processor. A 
product preview of the IMS T225, which is a 16 bit transputer with debugger support, is also included. 

The IMS M212 is an intelligent peripheral controller. It contains a 16 bit processor, on-chip memory and 
communications links. It contains hardware and interface logic to control disk drives and can be used as a 
programmable disk controller or as a general purpose peripheral interface. 

The INMOS serial communication link is a high speed system interconnect which provides full duplex com- 
munication between members of the transputer family. It can also be used as a general purpose interconnect 
even where transputers are not used. The IMS C011 and IMS C012 link adaptors are communications de- 
vices enabling the INMOS serial communication link to be connected to parallel data ports and microprocessor 
buses. Ths IMS C004 is a programmable link switch. It provides a full crossbar switch between 32 link inputs 
and 32 link outputs. 

The transputer development system referred to in this databook comprises an integrated editor, compiler 
and debugging system which enables transputers to be programmed in Occam and in industry standard 
languages, for example, C, Fortran, Pascal. The Transputer Development System Manual is supplied with 
the transputer development system and is available as a separate publication. 

Other information relevant to all transputer products is contained in the OCCam Reference Manual, supplied 
with INMOS software products and available as a separate publication. If more detail on the machine level 
operation is required, refer to Transputer Instruction Set - A Compiler Writers' Guide, which is available as a 
separate publication. 

Various application and technical notes are also available from INMOS. 

Software and hardware examples given in this databook are outline design studies and are included to 
illustrate various ways in which transputers can be used. The examples are not intended to provide accurate 
application designs. 

In addition to transputer devices, the INMOS product range also includes graphics products, digital signal 
processing devices and memory devices. For further information concerning INMOS products, please contact 
your local INMOS sales outlet. 



Notation and nomenclature 

The nomenclature and notation in general use throughout this databook is described below. 

Significance 

The bits in a byte are numbered to 7, with bit least significant. The bytes in words are numbered from 0, 
with byte least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Similarly, components of arrays are numbered starting from and stored in memory with component at the 
lowest address. 

Transputer memory is byte addressed, with words aligned on four-byte boundaries for 32 bit devices and on 
two-byte boundaries for 1 6 bit devices. 

Hexadecimal values are prefixed with #, as in ^1DF. 

Where a byte is transmitted serially, it is always transmitted least significant bit (0) first. In general, wherever 
a value is transmitted as a number of component values, the least significant component is transmitted first. 
Where an array is transmitted serially, component is transmitted first. Consequently, block transfers to and 
from memory are performed starting with the lowest (most negative) address and ending with the highest 
(most positive) one. 

In diagrams, the least significant component of a value is to the right hand side of the diagram. Component 
of an array is at the bottom of a diagram, as are the most negative memory locations. 

Signal naming conventions 

Signal names identifying individual pins of a transputer chip have been chosen to avoid being cryptic, giving 
as much information as possible. The majority of transputer signals are active high. Those which are active 
low have names commencing with not; names such as RnotW imply that the first component of the name 
refers to its active high state and the second to its active low state. Capitals are used to introduce new 
components of a name, as in ProcClockOut. 

All transputer signals described in the text of this databook are printed in bold. Registers and flags internal to 
a device are printed in italics, as are instruction operation codes. Italics are also used for emphasis. Occam 
program notation is printed in a fixed space teletype style. 

References 

The databook is divided into several chapters, each chapter having a number of sections and subsections. 
Figures and tables have reference numbers tied to relevant sections of a particular chapter of the databook. 
Unless othenA/ise stated, all references refer to those within the current chapter of the databook. 



XXI 



Transputer product numbers 

All INMOS products, both memories and transputers, have a part number of the general form 

IMS abbbc-xyyz 

Field a identifies the product group. This is a digit for memory products and a letter for other devices, the 
particular letter indicating the type of product (table 1). Field bbb identifies the product within that group and 
field c is its revision code. Field x denotes the package type, whilst field yy indicates speed variants etc. 
The final field z indicates to which specification the component is qualified; standard, military etc. Where 
appropriate some identifiers may be omitted, depending on the device. 

A typical product part would be IMS T800C-G20S. 



Table 1 INMOS products 



IMS1... 


Static RAM products 


IMS A... 


Digital signal processors 


IMSB... 


PC boards and modular hardware 


IMSC... 


Communications adaptors 


IMS D... 


Development system 


IMSG... 


Graphics products 


IMS L... 


Literature 


IMSM... 


Peripheral control transputers 


IMS P... 


Occam programming system 


IMSS... 


Software product 


IMST... 
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• INMOS 



1 Introduction 

INMOS is a recognised leader in the development and design of high-performance integrated circuits and is 
a pioneer in the field of parallel processing. The company manufactures components designed to satisfy the 
most demanding of current processing applications and also provide an upgrade path for future applications. 
Current designs and development will meet the requirements of systems in the next decade. Computing 
requirements essentially include high-performance, flexibility and simplicity of use. These characteristics are 
central to the design of all INMOS products. 

INMOS has a consistent record of innovation over a wide product range and supplies components to system 
manufacturing companies in the United States, Europe, Japan and the Far East. As developers of the 
Transputer, a unique microprocessor concept with a revolutionary architecture, and the Occam parallel 
processing language, INMOS has established the standards for the future exploitation of the power of parallel 
processing. INMOS products include a range of transputer products in addition to a highly successful range 
of high-performance graphics devices, an Innovative and successful range of high-performance digital signal 
processing (DSP) devices and a broad range of fast static RAMs, an area in which it has achieved a greater 
than 10% market share. 

The corporate headquarters, product design team and worldwide sales and marketing management are based 
at Bristol, UK. 

INMOS is constantly upgrading, improving and developing its product range and is committed to maintaining 
a global position of innovation and leadership. 

1.1 Manufacturing 

INMOS products are manufactured at the INMOS Newport, Duffryn facility which began operations in 1983. 
This is an 8000 square metre building with a 3000 square metre cleanroom operating to Class 10 environment 
in the work areas. 

To produce high performance products, where each microchip may consist of up to 400,000 transistors, 
INMOS uses advanced manufacturing equipment. Wafer steppers, plasma etchers and ion implanters form 
the basis of fabrication. 

1.2 Assembly 

Sub-contractors in Korea, Taiwan, Hong Kong and the UK are used to assemble devices. 

1.3 Test 

The final testing of commercial products is carried out at the INMOS Newport, Coed Rhedyn facility. Military 
final testing takes place at Colorado Springs. 

1.4 Quality and Reliability 

Stringent controls of quality and reliability provide the customer with early failure rates of less than 1000 
ppm and long term reliability rates of better than 100 FITs (one FIT is one failure per 1000 million hours). 
Requirements for military products are even more stringent. 

1.5 Military 

Various INMOS products are already available in military versions processed in full compliance with MIL-STD- 
883C. Further military programmes are currently in progress. 
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1.6 Future Developments 

1.6.1 Research and Development 

INMOS has achieved technical success based on a position of innovation and leadership in products and 
process technology in conjunction with substantial research and development investment. This investment 
has averaged 18% of revenues since inception and it is anticipated that future investment will be increased. 

1.6.2 Process Developments 

One aspect of the work of the Technology Development Group at Newport is to scale the present 1 .2 micron 
technology to 1.0 micron for products to be manufactured in 1989/90. In addition, work is in progress on the 
development of 0.8 micron CMOS technology. 
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1.1 



Overview 



A transputer is a microcomputer with its own local memory and with 
another transputer. 



inks for connecting one transputer to 



The transputer architecture defines a family of programmable VLSI components. The definition of the ar- 
chitecture falls naturally into the logical aspects which define how a system of interconnected transputers is 
designed and programmed, and the physical aspects which refine how transputers, as VLSI components, 
are interconnected and controlled. 

A typical member of the transputer product family is a single chip containing processor, memory, and com- 
munication links which provide point to point connection between transputers. In addition, each transputer 
product contains special circuitry and interfaces adapting it to a particular use. For example, a peripheral 
control transputer, such as a graphics or disk controller, has interfaces tailored to the requirements of a 
specific device. 

A transputer can be used in a single processor system or in networks to build high performance concur- 
rent systems. A network of transputers and peripheral controllers is easily constructed using point-to-point 
communication. 




Figure 1.2 Transputer network 



Transputers and Occam 

Transputers can be programmed in most high level languages, and are designed to ensure that compiled 
programs will be efficient. Where it is required to exploit concurrency, but still to use standard languages, 
Occam can be used as a harness to link modules written in the selected languages. 

To gain most benefit from the transputer architecture, the whole system can be programmed in OCCam 
(pages 12, 29). This provides all the advantages of a high level language, the maximum program efficiency 
and the ability to use the special features of the transputer. 

Occam provides a framework for designing concurrent systems using transputers in just the same way 
that boolean algebra provides a framework for designing electronic systems from logic gates. The system 
designer's task is eased because of the architectural relationship between OCCam and the transputer. A 
program running in a transputer is formally equivalent to an OCCam process, so that a network of transputers 
can be described directly as an OCCam program. 
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Figure 1 .3 A node of four transputers 



1.2 



System design rationale 



The transputer architecture simplifies system design by the use of processes as standard software and 
hardware building blocks. 

An entire system can be designed and programmed in Occam, from system configuration down to low level 
I/O and real time interrupts. 



1.2.1 Programming 

The software building block is the process. A system is designed in terms of an interconnected set of 
processes. Each process can be regarded as an independent unit of design. It communicates with other 
processes along point-to-point channels. Its internal design is hidden, and it is completely specified by the 
messages it sends and receives. Communication between processes is synchronized, removing the need for 
any separate synchronisation mechanism. 

Internally, each process can be designed as a set of communicating processes. The system design is 
therefore hierarchically structured. At any level of design, the designer is concerned only with a small and 
manageable set of processes. 

Occam is based on these concepts, and provides the definition of the transputer architecture from the logical 
point of view (pages 12, 29). 
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1.2.2 Hardware 

Processes can be implemented in hardware. A transputer, executing an occam program, is a hardware 
process. The process can be independently designed and compiled. Its internal structure is hidden and it 
communicates and synchronizes with other transputers via its links, which implement OCCam channels. 

Other hardware implementations of the process are possible. For example, a transputer with a different 
instruction set may be used to provide a different cost/performance trade-off. Alternatively, an implementation 
of the process may be designed in terms of hard-wired logic for enhanced performance. 

The ability to specify a hard-wired function as an OCCam process provides the architectural framework for 
transputers with specialized capabilities (e.g., graphics). The required function (e.g., a graphics drawing and 
display engine) is defined as an OCCam process, and implemented in hardware with a standard OCCam 
channel interface. It can be simulated by an Occam implementation, which in turn can be used to test the 
application on a development system. 

1.2.3 Programmable components 

A transputer can be programmed to perform a specialized function, and be regarded as a 'black box' thereafter. 
Some processes can be hard-wired for enhanced performance. 

A system, perhaps constructed on a single chip, can be built from a combination of software processes, pre- 
programmed transputers and hardware processes. Such a system can, itself, be regarded as a component 
in a larger system. 

The architecture has been designed to permit a network of programmable components to have any desired 
topology, limited only by the number of links on each transputer. The architecture minimizes the constraints 
on the size of such a system, and the hierarchical structuring provided by OCCam simplifies the task of 
system design and programming. 

The result is to provide new orders of magnitude of performance for any given application, which can now 
exploit the concurrency provided by a large number of programmable components. 

1.3 Systems architecture rationale 

1.3.1 Point to point communication links 

The transputer architecture simplifies system design by using point to point communication links. Every 
member of the transputer family has one or more standard links, each of which can be connected to a link 
of some other component. This allows transputer networks of arbitrary size and topology to be constructed. 

Point to point communication links have many advantages over multi-processor buses: 

There is no contention for the communication mechanism, regardless of the number of 
transputers in the system. 

There is no capacitive load penalty as transputers are added to a system. 

The communications bandwidth does not saturate as the size of the system increases. 
Rather, the larger the number of transputers in the system, the higher the total communi- 
cations bandwidth of the system. However large the system, all the connections between 
transputers can be short and local. 
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1.3.2 Local memory 

Each transputer in a system uses its own local memory. Overall memory bandwidth is proportional to the 
number of transputers in the system, in contrast to a large global memory, where the additional processors 
must share the memory bandwidth. 

Because memory interfaces are not shared, and are separate from the communications interfaces, they can 
be individually optimized on different transputer products to provide high bandwidth with the minimum of 
external components. 



1.4 



Communication 



To provide synchronised communication, each message must be acknowledged. Consequently, a link requires 
at least one signal wire' in each direction. 
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Figure 1.4 Links communicating between processes 



A link between two transputers is implemented by connecting a link interface on one transputer to a link 
interface on the other transputer by two one-directional signal lines, along which data is transmitted serially. 



The two signal wires of the link can be used to provide two OCCam channels, one in each direction, 
requires a simple protocol. Each signal line carries data and control information. 



This 



The link protocol provides the synchronized communication of OCCam. The use of a protocol providing for 
the transmission of an arbitrary sequence of bytes allows transputers of different word length to be connected. 

Each message is transmitted as a sequence of single byte communications, requiring only the presence of 
a single byte buffer in the receiving transputer to ensure that no information is lost. Each byte is transmitted 
as a start bit followed by a one bit followed by the eight data bits followed by a stop bit. After transmitting a 
data byte, the sender waits until an acknowledge is received; this consists of a start bit followed by a zero 
bit. The acknowledge signifies both that a process was able to receive the acknowledged byte, and that the 
receiving link is able to receive another byte. The sending link reschedules the sending process only after 
the acknowledge for the final byte of the message has been received. 

Data bytes and acknowledges are multiplexed down each signal line. An acknowledge can be transmitted as 
soon as reception of a data byte starts (if there is room to buffer another one). Consequently transmission 
may be continuous, with no delays between data bytes. 
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Figure 1 .5 Link protocol 

The links are designed to make the engineering of transputer systems straightforward. Board layout of two 
wire connections is easy to design and area efficient. All transputers will support a standard communications 
frequency of 10 Mbits/sec, regardless of processor performance. Thus transputers of different performance 
can be directly connected and future transputer systems will directly communicate with those of today. 
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Figure 1.6 Clocking transputers 

Link communication is not sensitive to clock phase. Thus, communication can be achieved between inde- 
pendently clocked systems as long as the communications frequency is the same. 

The transputer family includes a number of link adaptor devices which provide a means of interfacing trans- 
puter links to non-transputer devices. 
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2 Occam model 

The programming model for transputers is defined by OCcam (page 29). The purpose of this section is to 
describe how to access and control the resources of transputers using Occam. A more detailed description 
is available in the OCCam programming manual and the transputer development system manual (provided 
with the development system). 

The transputer development system will enable transputers to be programmed in other industry standard 
languages. Where it is required to exploit concurrency, but still to use standard languages, Occam can be 
used as a harness to link modules written in the selected languages. 

2.1 Overview 

In Occam processes are connected to form concurrent systems. Each process can be regarded as a black 
box with internal state, which can communicate with other processes using point to point communication 
channels. Processes can be used to represent the behaviour of many things, for example, a logic gate, a 
microprocessor, a machine tool or an office. 

The processes themselves are finite. Each process starts, performs a number of actions and then terminates. 
An action may be a set of sequential processes performed one after another, as in a conventional programming 
language, or a set of parallel processes to be performed at the same time as one another. Since a process 
is itself composed of processes, some of which may be executed in parallel, a process may contain any 
amount of internal concurrency, and this may change with time as processes start and terminate. 

Ultimately, all processes are constructed from three primitive processes - assignment, input and output. An 
assignment computes the value of an expression and sets a variable to the value. Input and output are 
used for communicating between processes. A pair of concurrent processes communicate using a one way 
channel connecting the two processes. One process outputs a message to the channel and the other process 
inputs the message from the channel. 

The key concept is that communication is synchronized and unbuffered. If a channel is used for input 
in one process, and output in another, communication takes place when both processes are ready. The 
value to be output is copied from the outputting process to the inputting process, and the inputting and 
outputting processes then proceed. Thus communication between processes is like the handshake method 
of communication used in hardware systems. 

Since a process may have internal concurrency, it may have many input channels and output channels 
performing communication at the same time. 

Every transputer implements the Occam concepts of concurrency and communication. As a result, Occam 
can be used to program an individual transputer or to program a network of transputers. When OCCam is 
used to program an individual transputer, the transputer shares its time between the concurrent processes 
and channel communication is implemented by moving data within the memory. When Occam is used to 
program a network of transputers, each transputer executes the process allocated to it. Communication 
between Occam processes on different transputers is implemented directly by transputer links. Thus the 
same Occam program can be implemented on a variety of transputer configurations, with one configuration 
optimized for cost, another for performance, or another for an appropriate balance of cost and performance. 

The transputer and OCcam were designed together. All transputers include special instructions and hardware 
to provide maximum performance and optimal implementations of the Occam model of concurrency and 
communications. 

All transputer instruction sets are designed to enable simple, direct and efficient compilation of Occam. 
Programming of I/O, interrupts and timing is standard on all transputers and conforms to the occam model. 

Different transputer variants may have different instruction sets, depending on the desired balance of cost, 
performance, internal concurrency and special hardware. The OCCam level interface will, however, remain 
standard across all products. 



Occam model 
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Figure 2.1 Mapping processes onto one or several transputers 



2.2 
2.2.1 



Occam overview 
Processes 



After it starts execution, a process performs a number of actions, and then either stops or terminates. Each 
action may be an assignment, an input, or an output. An assignment changes the value of a variable, an 
Input receives a value from a channel, and an output sends a value to a channel. 

At any time between its start and termination, a process may be ready to communicate on one or more of 
its channels. Each channel provides a one way connection between two concurrent processes; one of the 
processes may only output to the channel, and the other may only input from it. 

Assignment 

An assignment is indicated by the symbol :=. The example 

V := e 

sets the value of the variable v to the value of the expression e and then terminates, for example: 
X := sets X to zero, and x := x + 1 increases the value of x by 1 . 

Input 

An input is indicated by the symbol ? The example 
c ? X 

inputs a value from the channel c, assigns it to the variable x and then terminates. 

Output 

An output is indicated by the symbol ! The example 

c ! e 
outputs the value of the expression e to the channel c. 



14 2 transputer architecture 



2.2.2 Constructions 

A number of processes can be combined to form a construct. A construct is itself a process and can therefore 
be used as a component of another construct. Each component process of a construct is written two spaces 
further from the left hand margin, to indicate that it is part of the construct. There are four classes of constructs 
namely the sequential, parallel, conditional and the alternative construct. 

Sequence 

A sequential construct is represented by 

SEQ 
PI 
P2 
P3 

The component processes PI, P2, P3 ... are executed one after another. Each component process starts 
after the previous one terminates and the construct terminates after the last component process terminates. 
For example 

SEQ 

cl ? X 

X := X + 1 

c2 ! X 

inputs a value, adds one to it, and then outputs the result. 

Sequential constructs in occam are similar to programs written in conventional programming languages. 
Note, however, that they provide the performance and efficiency equivalent to that of an assembler for a 
conventional microprocessor. 

Parallel 

A parallel construct is represented by 

PAR 
PI 
P2 
P3 

The component processes PI, P2, P3 ... are executed together, and are called concurrent processes. The 
construct terminates after all of the component processes have terminated, for example: 

PAR 

cl ? X 
c2 ! y 

allows the communications on channels cl and c2 to take place together. 

The parallel construct is unique to Occam. It provides a straightforward way of writing programs which directly 
reflects the concurrency inherent in real systems. The implementation of parallelism on a single transputer 
is highly optimized so as to incur minimal process scheduling overhead. 
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Communication 

Concurrent processes communicate only by using channels, and communication is synchronized. If a channel 
is used for input in one process, and output in another, communication takes place when both the inputting 
and the outputting processes are ready. The value to be output is copied from the outputting process to the 
inputting process, and the processes then proceed. 

Communication between processes on a single transputer is via memory-to-memory data transfer. Between 
processes on different transputers it is via standard links. In either case the OCCam program is identical. 

Conditional 

A conditional construct 

IF 

conditionl 

PI 
conclition2 

P2 

means that PI is executed if conditionl is true, otherwise P2 is executed if condition2 is true, and 
so on. Only one of the processes is executed, and then the construct terminates, for example: 

IF 

X = 

y := y + 1 
X <> 

SKIP 

increases y only if the value of x is 0. 
Alternation 

An alternative construct 

ALT 

input 1 

PI 
input 2 

P2 
inputs 

P3 

waits until one of inputl, input2, inputs ... is ready. If inputl first becomes ready, inputl 
is performed, and then process PI is executed. Similarly, if input2 first becomes ready, input2 is 
performed, and then process P2 is executed. Only one of the inputs is performed, then its corresponding 
process is executed and then the construct terminates, for example: 

ALT 

count ? signal 

counter := counter + 1 
total ? signal 
SEQ 

out ! counter 
counter := 

either inputs a signal from the channel count, and increases the variable counter by 1, or alternatively 
inputs from the channel total, outputs the current value of the counter, then resets it to zero. 
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The ALT construct provides a formal language method of handling external and internal events that must be 
handled by assembly level interrupt programming in conventional microprocessors. 

Loop 

WHILE condition 
P 

repeatedly executes the process P until the value of the condition is false, for example: 

WHILE (X - 5) > 
X := X - 5 

leaves x holding the value of (x remainder 5) if x were positive. 

Selection 

A selection construct 

CASE s 
n 

PI 
m,q 

P2 

means that PI is executed if s has the same value as n, otherwise P2 is executed if s has the same value 
as m or q, and so on, for example: 

CASE direction 
up 

X := X + 1 
down 

X := X - 1 

increases the value of x if direction is equal to up, otherwise if direction is equal to down the value 
of X is decreased. 

Replication 

A replicator is used with a SEQ, PAR, IF or ALT construction to replicate the component process a number 
of times. For example, a replicator can be used with SEQ to provide a conventional loop. 

SEQ i = FOR n 
P 

causes thie process P to be executed n times. 

A replicator may be used with PAR to construct an array of concurrent processes. 

PAR i = FOR n 
Pi 

constructs an array of n similar processes PO, PI, ..., Pn-1. The index i takes the values 0, 1, ..., n-1, in 
PO, PI, ..., Pn-1 respectively. 



Occam model 
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2.2.3 



Types 



Every variable, expression and value has a type, which may be a primitive type, array type, record type or 
variant type. The type defines the length and interpretation of data. 

AH implementations provide the primitive types shown in table 2.1. 



Table 2.1 Types 



CHAN OF protocol 

TIMER 

BOOL 
BYTE 

INT 
INT16 
INT32 
INT64 
REALS 2 

REAL64 



Each communication channel provides communication between 
two concurrent processes. Each channel is of a type which 
allows communication of data according to the specified protocol. 

Each timer provides a clock which can be used by any number 
of concurrent processes. 

The values of type BOOL are true and false. 

The values of type BYTE are unsigned numbers n 
in the range <=n< 256. 

Signed integers n in the range -2^^ <=n< 2^\ 

Signed integers n in the range -2^^ <=n< 2^^. 

Signed integers n in the range -2^^ <=n< 2^\ 

Signed integers n in the range -2^^ <=n< 2^^. 

Floating point numbers stored using a sign bit, 8 bit exponent and 

23 bit fraction in ANSI/IEEE Standard 754-1985 representation.^ 

Floating point numbers stored using a sign bit, 1 1 bit exponent and 
52 bit fraction in ANSI/IEEE Standard 754-1985 representation.^ 



2.2.4 Declarations, arrays and subscripts 

A declaration T x declares x as a new channel, variable, timer or array of type T, for example: 

INT x: 

P 

declares x as an integer variable for use in process P. 

Array types are constructed from component types. For example [ n ] T is an array type constructed from 
n components of type T. 

A component of an array may be selected by subscription, for example v[e] selects the e'th component of 

V. 

Asetof components of an array may be selected by subscription, for example [v FROM e FOR c] selects 
the c components v[e] , v[e + 1] , ... v[e + c - 1] . A set of components of an array may 
be assigned, input or output. 



"" IEEE Standard for Binary Floating-Point arithmetic 
ANSI/IEEE Std 754-1985 
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2.2.5 Procedures 

A process may be given a name, for example: 

PROC square (INT n) 
n := n * n 



defines the procedure square. The name may be used as an instance of the process, for example: 
square (x) 

is equivalent to 



n IS X 
n := n 



= n * 



2.2.6 Functions 

A function can be defined in the same way as a procedure. For example: 

INT FUNCTION factorial (VAL INT n) 
INT product : 
VALOF 
IF 

n >= 
SEQ 

product := 1 
SEQ i = 1 FOR n 

product := product * i 
RESULT product 

defines the function factorial, which may appear in expressions such as 
m := factorial (6) 

2.2.7 Expressions 

An expression is constructed from the operators given in table 2.2, from variables, numbers, the truth values 
TRUE and FALSE, and the brackets ( and ) . 



Table 2.2 Operators 



Operator 


Operand types 


Description 


+ - * / REM 


integer, real 


arithmetic operators 




PLUS MINUS TIMES AFTER 


integer 


modulo arithmetic 




= <> 


any primitive 


relational operators 




><>=<= 


integer, real 


relational operators 




AND OR NOT 


boolean 


boolean operators 




A \/ X - 


integers 


bitwise operators: and, or, xor, not 


« » 


integer 


shift operators 
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For example, the expression 

(5 + 7) / 2 
evaluates to 6, and the expression 

(#1DF /\ #F0) » 4 

evaluates to #D (the character # introduces a hexadecimal constant). 

A string is represented as a sequence of ASCII characters, enclosed in double quotation marks ". If the 
string has n characters, then it is an array of type [n]BYTE. 

2.2.8 Timer 

All transputers incorporate a timer. The implementation directly supports the OCCam model of time. Each 
process can have its own independent timer, which can be used for internal measurement or for real time 
scheduling. 

A timer input sets a variable to a value of type INT representing the time. The value is derived from a clock, 
which changes at regular intervals, for example: 

tim ? V 

sets the variable v to the current value of a free running clock, declared as the timer tim. 

A delayed input takes the following form 

tim ? AFTER e 

A delayed input is unable to proceed until the value of the timer satisfies {timer AFTER e). The comparison 
performed is a modulo comparison. This provides the effect that, starting at any point in the timer's cycle, 
the previous half cycle of the timer is considered as being before the current time, and the next half cycle is 
considered as being after the current time. 

2.2.9 Peripheral access 

The implementation of OCCam provides for peripheral access by extending the input and output primitives 
with a port input/output mechanism. A port is used like an occam channel, but has the effect of transferring 
information to and from a block of addresses associated with a peripheral. 

Ports behave like OCCam channels in that only one process may input from a port, and only one process 
may output to a port. Thus ports provide a secure method of accessing external memory mapped status 
registers etc. 

Note that there is no synchronization mechanism associated with port input and output. Any timing constraints 
which result from the use of asynchronous external hardware will have to be programmed explicitly. For 
example, a value read by a port input may depend upon the time at which the input was executed, and 
inputting at an invalid time would produce unusable data. 

During applications development it is recommended that the peripheral is modelled by an OCCam process 
connected via channels. 
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2.3 Configuration 

Occam programs may be configured for execution on one or many transputers. The transputer development 
system provides the necessary tools for correctly distributing a program configured for many transputers. 

Configuration does not affect the logical behaviour of a program (see section four, Program development). 
However, it does enable the program to be arranged to ensure that performance requirements are met. 

PLACED PAR 

A parallel construct may be configured for a network of transputers by using the PLACED PAR construct. 
Each component process (termed a placement) is executed by a separate transputer. The variables and 
timers used in a placement must be declared within each placement process. 

PRI PAR 

On any individual transputer, the outermost parallel construct may be configured to prioritize its components. 
Each process is executed at a separate priority. The first process has the highest priority, the last process 
has the lowest priority. Lower priority components may only proceed when all higher priority components are 
unable to proceed. 

2.3.1 INMOS standard links 

Each link provides one channel in each direction between two transputers. 

A channel (which must already have been declared) is associated with a link by a channel association, for 
example: 

PLACE LinkOInput AT 4 : 
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3 Error handling 

Errors in OCCam programs are either detected by the compiler or can be handled at runtime in one of three 
ways. 

1 Cause the process to STOP allowing other processes to continue. 

2 Cause the whole system to halt. 

3 Have an arbitrary (undefined) effect. 

The Occam process stop starts but never terminates. In method 1 , an errant process stops and in particular 
cannot communicate erroneous data to other processes. Other processes will continue to execute until they 
become dependent on data from the stopped process. It is therefore possible, for example, to write a 
process which uses a timeout to warn of a stopped process, or to construct a redundant system in which 
several processes performing the same task are used to enable the system to continue after one of them 
has failed. 

Method 1 is the preferred method of executing a program. 

Method 2 is useful for program development and can be used to bring transputers to an immediate halt, 
preventing execution of further instructions. The transputer Error output can be used to inform the transputer 
development system that such an error has occurred. No variable local to the process can be overwritten 
with erroneous data, facilitating analysis of the program and data which gave rise to the error. 

Method 3 is useful only for optimising programs which are known to be correct! 

When a system has stopped or halted as a result of an error, the state of all transputers in the system can 
be analysed using the transputer development system. 

For languages other than Occam, the transputer provides facilities for handling individual errors by software. 
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4 Program development 

The development of programs for multiple processor systems can involve experimentation. In some cases, 
tlie most effective configuration is not always clear until a substantial amount of work has been done. For 
this reason, it is desirable that most of the design and programming can be completed before hardware 
construction is started. 

4.1 Logical behaviour 

An important property of OCCam in this context is that it provides a clear notion of 'logical behaviour'; this 
relates to those aspects of a program not affected by real time effects. 

It is guaranteed that the logical behaviour of a program is not altered by the way in which the processes 
are mapped onto processors, or by the speed of processing and communication. Consequently a program 
ultimately intended for a network of transputers can be compiled, executed and tested on a single computer 
used for program development. 

Even if the application uses only a single transputer, the program can be designed as a set of concurrent 
processes which could run on a number of transputers. This design style follows the best traditions of 
structured programming; the processes operate completely independently on their own variables except 
where they explicitly interact, via channels. The set of concurrent processes can run on a single transputer 
or, for a higher performance product, the processes can be partitioned amongst a number of transputers. 

It is necessary to ensure, on the development system, that the logical behaviour satisfies the application 
requirements. The only ways in which one execution of a program can differ from another in functional 
terms result from dependencies upon input data and the selection of components of an ALT. Thus a simple 
method of ensuring that the application can be distributed to achieve any desired performance is to design 
the program to behave 'correctly' regardless of input data and ALT selection. 

4.2 Performance measurement 

Performance information is useful to gauge overall throughput of an application, and has to be considered 
carefully in applications with real time constraints. 

Prior to running in the target environment, an occam program should be relatively mature, and indeed should 
be correct except for interactions which do not obey the Occam synchronization rules. These are precisely 
the external interactions of the program where the world will not wait to communicate with an OCcam process 
which is not ready. Thus the set of interactions that need to be tested within the target environment are well 
identified. 

Because, in occam, every program is a process, it is extremely easy to add monitor processes or simulation 
processes to represent parts of the real time environment, and then to simulate and monitor the anticipated 
real time interactions. The Occam concept of time and its implementation in the transputer is important. 
Every process can have an independent timer enabling, for example, all the real time interactions to be 
modelled by separate processes and any time dependent features to be simulated. 

4.3 Separate compilation of OCCam and other languages 

A program portion which is separately compiled, and possibly written in a language other than Occam, may 
be executed on a single transputer. 

If the program is written in OCCam, then it takes the form of a single PROG, with only channel parameters. If 
the program is written in a language other than OCCam, then a run-time system is provided which provides 
input/output to Occam channels. 
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Such separately compiled program portions are linked together by a framework of channels, termed a harness. 
The harness is written in OCCam. It includes all configuration information, and in particular specifies the 
transputer configuration in which the separately compiled program portion is executed. 

Transputers are designed to allow efficient implementations of high level languages, such as C, Pascal and 
Fortran. Such languages will be available in addition to OCCam. 

At runtime, a program written in such a language is treated as a single OCCam process. Facilities are 
provided in the implementations of these languages to allow such a program to communicate on Occam 
channels. It can thus communicate with other such programs, or with programs written in Occam. These 
programs may reside on the same transputer, in which case the channels are implemented in store, or may 
reside on different transputers, in which case the channels are implemented by transputer links. 

It is therefore possible to implement Occam processes in conventional high level languages, and arrange for 
them to communicate. It is possible for different parts of the same application to be implemented in different 
high level languages. 

The standard input and output facilities provided within these languages are implemented by a well-defined 
protocol of communications on OCCam channels. 

The development system provides facilities for management of separately compiled OCCam. 

4.4 Memory map and placement 

The low level memory model is of a signed address space. 

Memory is byte addressed, the lowest addressed byte occupying the least significant byte position within the 
word. 

The implementation of OCCam supports the allocation of the code and data areas of an occam process to 
specific areas of memory. Such a process must be a separately compiled PROG, and must not reference any 
variables and timers other than those declared within it. 
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5 Physical architecture 

5.1 INMOS serial links 

5.1.1 Overview 

All transputers have several links. The link protocol and electrical characteristics form a standard for all 
INMOS transputer and peripheral products. 

All transputers support a standard link communications frequency of 10 Mbits/sec. Some devices also support 
other data rates. Maintaining a standard communications frequency means that devices of mixed performance 
and type can intercommunicate easily. 

Each link consists of two unidirectional signal wires carrying both data and control bits. The link signals are 
TTL compatible so that their range can be easily extended by inserting buffers. 

The INMOS communication links provide for communication between devices on the same printed circuit 
board or between printed circuit boards via a back plane. They are intended to be used in electrically quiet 
environments in the same way as logic signals between TTL gates. 

The number of links, and any communication speeds in addition to the standard speed of 10 Mbits/sec, are 
given in the product data for each product. 

5.1.2 Link electrical specification 

The quiescent state of the link signals is low, for a zero. The link input signals and output signals are standard 
TTL compatible signals. 

For correct functioning of the links the specifications for maximum variation in clock frequency between two 
transputers joined by a link and maximum capacitive load must be met. Each transputer product also has 
specified the maximum permissible variation in delay in buffering, and minimum permissible edge gradients. 
Details of these specifications are provided in the product data. 

Provided that these specifications are met then any buffering employed may introduce an arbitrary delay into 
a link signal without affecting its correct operation. 

5.2 System services 

5.2.1 Powering up and down, running and stopping 

At all times the specification of input voltages with respect to the GND and VCC pins must be met. This 
includes the times when the VCC pins are ramping to 5 V, and also while they are ramping from 5 V down 
too V. 

The system services comprise the clocks, power, and signals used for initialization. 

The specification includes minimum times that VCC must be within specification, the input clock must be 
oscillating, and the Reset signal must be high before Reset goes low. These specifications ensure that 
internal clocks and logic have settled before the transputer starts. 

When the transputer is reset the memory interface is initialised (if present and configurable). 

The processor and INMOS serial links start after reset. The transputer obeys a bootstrap program which 
can either be in off-chip ROM or can be received from one of the links. How to specify where the bootstrap 
program is taken from depends upon the type of transputer being used. The program will normally load up 
a larger program either from ROM or from a peripheral such as a disk. 

During power down, as during power up, the input and output pins must remain within specification with 
respect to both GND and VCC. 
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A software error, such as arithmetic overflow, array bounds violation or divide by zero, causes an error flag to 
be set in the transputer processor. The flag is directly connected to the Error pin. Both the flag and the pin 
can be ignored, or the transputer stopped. Stopping the transputer on an error means that the error cannot 
cause further corruption. 

As well as containing the error in this way it is possible to determine the state of the transputer and its memory 
at the time the error occurred. 

5.2.2 Clock distribution 

All transputers operate from a standard 5MHz input clock. High speed clocks are derived internally from the 
low frequency input to avoid the problems of distributing high frequency clocks. Within limits the mark-to- 
space ratio, the voltage levels and the transition times are immaterial. The limits on these are given in the 
product data for each product. The asynchronous data reception of the links means that differences in the 
clock phase between chips is unimportant. 

The important characteristic of the transputer's input clock is its stability, such as is provided by a crystal 
oscillator. An R-C oscillator is inadequate. The edges of the clock should be monotonic (without kinks), and 
should not undershoot below -0.5 V. 

5.3 Bootstrapping from ROM or from a link 

The program which is executed after reset can either reside in ROM in the transputer's address space or it 
can be loaded via any one of the transputer's INMOS serial links. 

The transputer bootstraps from ROM by transferring control to the top two bytes in memory, which will 
invariably contain a backward jump into ROM. 

If bootstrapping from a link, the transputer bootstraps from the first link to receive a message. The first byte 
of the message is the count of the number of bytes of program which follow. The program is loaded into 
memory starting at a product dependent location MemStart, and then control is transferred to this address. 

Messages subsequently arriving on other links are not acknowledged until the transputer processor obeys 
a process which inputs from them. The loading of a network of transputers is controlled by the transputer 
development system, which ensures that the first message each transputer receives is the bootstrap program. 

5.4 Peripheral interfacing 

All transputers contain one or more INMOS serial links. Certain transputer products also have other application 
specific interfaces. The peripheral control transputers contain specialized interfaces to control a specific 
peripheral or peripheral family. 

In general, a transputer based application will comprise a number of transputers which communicate using 
INMOS links. There are three methods of communicating with peripherals. 

The first is by employing peripheral control transputers (eg for graphics or disks), in which the transputer chip 
connects directly to the peripheral concerned (figure 5.1). The interface to the peripheral is implemented by 
special purpose hardware within the transputer. The application software in the transputer is implemented 
as an OCcam process, and controls the interface via Occam channels linking the processor to the special 
purpose hardware. 

The second method is by employing link adaptors (figure 5.2). These devices convert between a link and a 
specialized interface. The link adaptor is connected to the link of an appropriate transputer, which contains 
the application designer's peripheral device handler implemented as an occam process. 

The third method is by memory mapping the peripheral onto the memory bus of a transputer (figure 5.3). 
The peripheral is controlled by memory accesses issued as a result of PORT inputs and outputs. The 
application designer's peripheral device handler provides a standard Occam channel interface to the rest of 
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the application. 

The first transputers implement an event pin wliich provides a simple means for an external peripheral to 
request attention from a transputer. 

In all three methods, the peripheral driver interfaces to the rest of the application via OCCam channels. 
Consequently, a peripheral device can be simulated by an occam process. This enables testing of all 
aspects of a transputer system before the construction of hardware. 
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Figure 5.1 Transputer with peripheral control transputers 
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Figure 5.2 Transputer with link adaptors 
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1 Introduction 

The INMOS transputer family is a range of system components each of which combines processing, memory 
and interconnect in a single VLSI chip. A concurrent system can be constructed from a collection of transputers 
which operate concurrently and communicate through serial communication links. Such systems can be 
designed and programmed in occam, a language based on communicating processes, and in other industry 
standard languages. Transputers have been sucessfully used in application areas ranging from embedded 
systems to supercomputers. 

The first member of the family, the IMS T414 32-bit transputer, was introduced in September 1985, and has 
enabled concurrency to be applied in a wide variety of applications such as simulation, robot control, image 
synthesis, and digital signal processing. Many computationally intensive applications can exploit large arrays 
of transputers; the system performance depending on the number of transputers, the speed of inter-transputer 
communication and the performance of each transputer processor. 

The power of transputer based systems lies in the smoothly scaleable performance offered by adding more 
transputers. The transputer embodies the concepts required for effective parallel processing. 

Further transputer products are continually being developed which increase the memory, processing perfor- 
mance and communications performance. An important example is the floating point transputer first intro- 
duced in 1987. 
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2 The transputer: basic architecture and concepts 

2.1 A programmable device 

The transputer is a component designed to exploit the potential of VLSI. This technology allows large numbers 
of identical devices to be manufactured cheaply. For this reason, it is attractive to implement a concurrent 
system using a number of identical components, each of which is customised by an appropriate program. 
The transputer is, therefore, a VLSI device with a processor, memory to store the program executed by 
the processor, and communication links for direct connection to other transputers. Transputer systems can 
be designed and programmed using Occam which allows an application to be described as a collection of 
processes which operate concurrently and communicate through channels. The transputer can therefore be 
used as a building block for concurrent processing systems, with Occam as the associated design formalism. 

2.2 Occam 

Occam enables a system to be described as a collection of concurrent processes, which communicate with 
each other and with peripheral devices through channels. OCCam programs are built from three primitive 
processes: 

V := e assign expression e to variable v 

c ! e output expression e to channel c 

c ? V input from channel c to variable v 

The primitive processes are combined to form constructs: 

SEQuential components executed one after another 

PARallel components executed together 

ALTernative component first ready is executed 

A construct is itself a process, and may be used as a component of another construct. 

Conventional sequential programs can be expressed with variables and assignments, combined in sequential 
constructs. IF and WHILE constructs are also provided. 

Concurrent programs can be expressed with channels, inputs and outputs, which are combined in parallel 
and alternative constructs. 

Each Occam channel provides a communication path between two concurrent processes. Communication 
is synchronised and takes place when both the inputting process and the outputting process are ready. The 
data to be output is then copied from the outputting process to the inputting process, and both processes 
continue. 

An alternative process may be ready for input from any one of a number of channels. In this case, the input 
is taken from the channel which is first used for output by another process. 

2.3 VLSI technology 

One important property of VLSI technology is that communication between devices is very much slower than 
communication within a device. In a computer, almost every operation that the processor performs involves 
the use of memory. For this reason a transputer includes both processor and memory in the same integrated 
circuit device. 
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In any system constructed from integrated circuit devices, much of the physical bulk arises from connections 
between devices. The size of the package for an integrated circuit is determined more by the number of 
connection pins than by the size of the device itself. In addition, connections between devices provided by 
paths on a circuit board consume a considerable amount of space. 

The speed of communication between electronic devices is optimised by the use of one-directional signal 
wires, each connecting two devices. If many devices are connected by a shared bus, electrical problems of 
driving the bus require that the speed is reduced. Also, additional control logic and wiring are required to 
control sharing of the bus. 

To provide maximum speed with minimal wiring, the transputer uses point-to-point serial communication links 
for direct connection to other transputers. The protocols used on the transputer links are discussed later. 

2.4 Simplified processor with micro-coded scheduler 

The most effective implementation of simple programs by a programmable computer is provided by a se- 
quential processor. Consequently, the transputer has a fairly conventional microcoded processor. There is 
a small core of about 32 instructions which are used to implement simple sequential programs. In addition 
there are other, more specialised groups of instructions which provide facilities such as long arithmetic and 
process scheduling. 

As a process executed by a transputer may itself consist of a number of concurrent processes the transputer 
has to support the OCCam programming model internally. The transputer, therefore, has a microcoded 
scheduler which shares the processor time between the concurrent processes. The scheduler provides two 
priority levels; any high priority process which can run will do so in preference to any low priority process. 
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3 Transputer internal architecture 

Internally, a transputer consists of a memory, processor and communications system connected via a 32-bit 
bus. The bus also connects to the external memory interface, enabling additional local memory to be used. 
The processor, memory and communications system each occupy about 25% of the total silicon area, the 
remainder being used for power distribution, clock generators and external connections. 

The floating point transputers each have an on-chip floating point unit. The small size and high performance 
of this unit come from a design which takes careful note of silicon economics. This contrasts starkly with 
conventional co-processors, where the floating point unit typically occupies more area than a complete micro- 
processor, and requires a second chip. 

The block diagram 3.1 indicates the way in which the major blocks of the transputer are interconnected. 
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Figure 3.1 Transputer interconnections 

The CPU of the transputers contains three registers (A, B and C) used for integer and address arithmetic, 
which form a hardware stack. Loading a value into the stack pushes B into C, and A into B, before loading A. 
Storing a value from A pops B into A and C into B. Similarly, the FPU includes a three register floating-point 
evaluation stack, containing the AF, BF, and CF registers. When values are loaded onto, or stored from the 
stack the AF, BF and CF registers push and pop in the same way as the A, B and C registers. 

The addresses of floating point values are formed on the CPU stack, and values are transferred between the 
addressed memory locations and the FPU stack under the control of the CPU. As the CPU stack is used only 
to hold the addresses of floating point values, the wordlength of the CPU is independent of that of the FPU. 
Consequently, it would be possible to use the same FPU together with a 16-bit CPU. 

The transputer scheduler provides two priority levels. The FPU register stack is duplicated so that when the 
floating point transputer switches from low to high priority none of the state in the floating point unit is written 
to memory. This results in a worst-case interrupt response of about 3 )us. Furthermore, the duplication of 
the register stack enables floating point arithmetic to be used in an interrupt routine without any performance 
penalty. 
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3 transputer overview 



3.1 



Sequential processing 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; the CPU contains six registers which are used in the execution of a sequential process. 
The small number of registers, together with the simplicity of the instruction set enables the processor to have 
relatively simple (and fast) data-paths and control logic. 

The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The A, B and C registers which form an evaluation stack, and are the sources and destinations for most 
arithmetic and logical operations. Loading a value into the stack pushes B into C, and A into B, before 
loading A. Storing a value from A, pops B into A and C into B. 
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Figure 3.2 Registers 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the acfGf instruction adds the top two values in the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 



3.2 



Instructions 



It was a design decision that the transputer should be programmed in a high-level language. The instruction 
set has, therefore, been designed for simple and efficient compilation. It contains a relatively small number 
of instructions, all with the same format, chosen to give a compact representation of the operations most 
frequently occuring in programs. The instruction set is independant of the processor wordlength, allowing the 
same microcode to be used for transputers with different wordlengths. Each instruction consists of a single 
byte divided into two 4-bit parts. The four most significant bits of the byte are a function code, and the four 
least significant bits are a data value. 
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Figure 3.3 Instruction format 



3.2.1 



Direct functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Thirteen of 
these are used to encode the most important functions performed by any computer. These include: 



load constant 
load local 
load non-local 
jump 



add constant 
store local 
store non-local 
conditional jump 



load local pointer 
call 



The most common operations in a program are the loading of small literal values, and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local and store local instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of block structured programming languages such as OCCam. 

3.2.2 Prefix functions 

Two more of the function codes are used to allow the operand of any instruction to be extended in length. 
These are: 



prefix 



negative prefix 



All instructions are executed by loading the four data bits into the least significant four bits of the operand 
register, which is then used as the the instruction's operand. All instructions except the prefix instructions 
end by clearing the operand register, ready for the next instruction. 
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Figure 3.4 Instruction operand register 

The prefix instruction loads its four data bits into the operand register, and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 
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The use of prefix instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation, by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form independent of the processor 
wordlength. 

3.2.3 Indirect functions 

The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate instruction just like any 
other. The instruction representation therefore provides for an indefinite number of operations. 

The encoding of the indirect functions is chosen so that the most frequently occuring operations are repre- 
sented without the use of a prefix instruction. These include arithmetic, logical and comparison operations 
such as 

add exclusive or greater than 

Less frequently occuring operations have encodings which require a single prefix operation (the transputer 
instruction set is not large enough to require more than 512 operations to be encoded!). 

The IMS T800 has additional instructions which load into, operate on, and store from, the floating point 
register stack. It also contains new instructions which support colour graphics, pattern recognition and the 
implementation of error correcting codes. These instructions have been added whilst retaining the existing 
IMS T414 instruction set. This has been possible because of the extensible instruction encoding used in 
transputers. 

3.2.4 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte (ie without the use 
of prefix instructions). Many of these instructions, such as load constant and add require just one processor 
cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive several instructions for every fetch. 

Short instructions also improve the effectiveness of instruction prefetch, which in turn improves processor 
performance. There is an extra word of prefetch buffer so that the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer is short, there is little time penalty when a jump instruction causes 
the buffer contents to be discarded. 

3.3 Support for concurrency 

The processor provides efficient support for the Occam model of concurrency and communication. It has a 
microcoded scheduler which enables any number of concurrent processes to be executed together, sharing 
the processor time. This removes the need for a software kernel. The processor does not need to support the 
dynamic allocation of storage as the Occam compiler is able to perform the allocation of space to concurrent 
processes. 
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At any time, a concurrent process may be 



active 



inactive 



being executed 

on a list waiting to be executed 

ready to input 

ready to output 

waiting until a specified time 



The scheduler operates in such a way that inactive processes do not consume any processor time. The active 
processes waiting to be executed are held on a list. This is a linked list of process workspaces, implemented 
using two registers, one of which points to the first process on the list, the other to the last. In figure 3.5, S is 
executing, and P, Q and R are active, awaiting execution. 
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Figure 3.5 Linked process list 

A process is executed until it is unable to proceed because it is waiting to input or output, or waiting for the 
timer. Whenever a process is unable to proceed, its instruction pointer is saved in its workspace and the next 
process is taken from the list. Actual process switch times are very small as little state needs to be saved; it 
is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model. These include 



start process 



end process 



When a parallel construct is executed, start process instructions are used to create the necessary concurrent 
processes. A start process instruction creates a new process by adding a new workspace to the end of the 
scheduling list, enabling the new concurrent process to be executed together with the ones already being 
executed. 



The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the components of the parallel construct which have still to terminate. 
The counter is initialised to the number of components before the processes are 'started'. Each component 
ends with an end process instruction which decrements and tests the counter. For all but the last component, 
the counter is non zero and the component is descheduled. For the last component, the counter is zero and 
the component continues. 
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3 transputer overview 



3.4 



Communications 



Communication between processes is achieved by means of channels. Occam communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 

A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
links. The processor provides a number of operations to support message passing, the most important being 

input message output message 

The input message and output message instructions use the address of the channel to determine whether 
the channel is internal or external. This means that the same instruction sequence can be used for both hard 
and soft channels, allowing a process to be written and compiled without knowledge of where its channels 
are connected. 

As in the occam model, communication takes place when both the inputting and outputting processes are 
ready. Consequently, the process which first becomes ready must wait until the second one is also ready. 

A process performs an input or output by loading the evaluation stack with a pointer to a message, the 
address of a channel, and a count of the number of bytes to be transferred, and then executing an input 
message or an output message instruction. 



3.4.1 



Internal channel communication 



At any time, an internal channel (a single word in memory) either holds the identity of a process, or holds the 
special value empty. The channel is initialised to emp/y before it is used. 

When a message is passed using the channel, the identity of the first process to become ready is stored 
in the channel, and the processor starts to execute the next process from the scheduling list. When the 
second process to use the channel becomes ready, the message is copied, the waiting process is added to 
the scheduling list, and the channel reset to its initial state. It does not matter whether the inputting or the 
outputting process becomes ready first. 

In figure 3.6, a process P is about to execute an output instruction on an 'empty' channel C. The evaluation 
stack holds a pointer to a message, the address of channel C, and a count of the number of bytes in the 
message. 
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Figure 3.6 Output to empty channel 
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After executing the output instruction, the channel C holds the address of the workspace of P, and the address 
of the message to be transferred is stored in the workspace of P. P is descheduled, and the process starts 
to execute the next process from the scheduling list. 
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Figure 3.7 

The channel C and the process P remain in this state until a second process, Q executes an output instruction 
on the channel. 
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Figure 3.8 

The message is copied, the waiting process P is added to the scheduling list, and the channel C is reset to 
its initial 'empty' state. 
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Figure 3.9 
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3 transputer overview 



3.4.2 External channel communication 

When a message is passed via an external channel the processor delegates to an autonomous link interface 
the job of transferring the message and deschedules the process. When the message has been transferred 
the link interface causes the processor to reschedule the waiting process. This allows the processor to 
continue the execution of other processes whilst the external message transfer is taking place. 

Each link interface uses three registers: 

a pointer to a process workspace 

a pointer to a message 

a count of bytes in the message 



In figure 3.10 processes P and Q executed by different transputers communicate using a channel C imple- 
mented by a link connecting two transputers. P outputs, and Q inputs. 
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Figure 3.10 Communication between transputers 
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Figure 3.11 

When P executes its output instruction, the registers in the link interface of the transputer executing P are 
initialised, and P is descheduled. Similarly, when Q executes its input instruction, the registers in the link 
interface of the process executing Q are initialised, and Q is descheduled (figure 3.11). 

The message is now copied through the link, after which the workspaces of P and Q are returned to the 
corresponding scheduling lists (figure 3.12). The protocol used on P and Q ensures that it does not matter 
which of P and Q first becomes ready. 
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Figure 3.12 



3.4.3 



Communication links 



A link between two transputers is implemented by connecting a link interface on one transputer to a link 
interface on the other transputer by two one-directional signal wires, along which data is transmitted serially. 
The two wires provide two OCCam channels, one in each direction. This requires a simple protocol to 
multiplex data and control information. Messages are transmitted as a sequence of bytes, each of which 
must be acknowledged before the next is transmitted. A byte of data is transmitted as a start bit followed by 
a one bit followed by eight bits of data followed by a stop bit. An acknowledgement is transmitted as a start 
bit followed by a stop bit. An acknowledgement indicates both that a process was able to receive the data 
byte and that it is able to buffer another byte. 

The protocol permits an acknowledgement to be generated as soon as the receiver has identified a data 
packet. In this way the acknowledgement can be received by the transmitter before all of the data packet has 
been transmitted and the transmitter can transmit the next data packet immediately. Some transputers do 
not implement this overlapping and achieve a data rate of 0.8 Mbytes/sec using a link to transfer data in one 
direction. However, by implementing the overlapping and including sufficient buffering in the link hardware, 
the rate can be more than doubled to achieve 1 .8 Mbytes/sec in one direction, and 2.4 Mbytes/sec when the 
link carries data in both directions. The diagram below shows the signals that would be observed on the two 
link wires when a data packet is overlapped with an acknowledgement. 











1 


1 


Data 









Data byte 








1 









Acknowledg( 


3 message 







Figure 3.13 Link data and acknowledge formats 



40 3 transputer overview 



Input 
Output 






















Link 




1 
















Link 


1 


1 


1 

< 

1 


1 1 1 1 1 

DATA 
1 1 1 1 1 


1 

1 














time 









Figure 3.14 Overlapped link acknowledge 

3.5 Timer 

The transputer has a clock which 'ticks' every microsecond. The current value of the processor clock can be 
read by executing a read f/mer instruction. 

A process can arrange to perform a timer input, in which case it will become ready to execute after a specified 
time has been reached. 

The timer input instruction requires a time to be specified. If this time is in the 'past' (i.e. ClockReg AFTER 
SpecifiedTime) then the instruction has no effect. If the time is in the 'future' (i.e. SpecifiedTime AFTER 
Clockregor SpecifiedTime = CiockReg) then the process is descheduled. When the specified time is reached 
the process is scheduled again. 

3.6 Alternative 

The Occam alternative construct enables a process to wait for input from any one of a number of channels, 
or until a specific time occurs. This requires special instructions, as the normal input instruction deschedules 
a process until a specific channel becomes ready, or until a specific time is reached. The instructions are: 

enable cfiannei disable channel 

enable timer disable timer 

alternative wait 

The alternative is implemented by 'enabling' the channel input or timer input specified in each of its compo- 
nents. The 'alternative wait' is then used to deschedule the process if none of the channel or timer inputs is 
ready; the process will be re-scheduled when any one of them becomes ready. The channel and timer inputs 
are then 'disabled'. The 'disable' instructions are also designed to select the component of the alternative to 
be executed; the first component found to be ready is executed. 

3.7 Floating point instructions 

The core of the floating point instruction set was established fairly early in the design of the floating point 
transputer. This core includes simple load, store and arithmetic instructions. Examination of statistics derived 
from FORTRAN programs suggested that the addition of some more complex instructions would improve 
performance and code density. Proposed changes to the instruction set were assesed by examining their 
effect on a number of numerical programs. For each proposed instruction set, a compiler was constructed, 
the programs compiled with it, and the resulting code then run on a simulator. The resulting instruction set is 
now described. 

In the floating point transputer operands are transferred between the transputer's memory and the floating 
point evaluation stack by means of floating point load and store instructions. There are two groups of such 
instructions, one for single length numbers, one for double length. In the description of the load and store 
instructions which follow only the double length instructions are described. However, there are single length 
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instructions which correspond with each of the double length instructions. 

The address of a floating point operand is computed on the CPU's stack and the operand is then loaded, 
from the addressed memory location, onto the FPU's stack. Operands in the floating point stack are tagged 
with their length. The operand's tag will be set when the operand is loaded or is computed. The tags allow 
the number of instructions needed for floating point operations to be reduced; there is no need, for example, 
to have both floating add single and floating add double instructions; a single floating aofd will suffice. 

3.7.1 Optimising use of the stack 

The depth of the register stacks in the CPU and FPU is carefully chosen. Floating point expressions commonly 
have embedded address calculations, as the operands of floating point operators are often elements of one 
dimensional or two dimensional arrays. The CPU stack is deep enough to allow most integer calculations 
and address calculations to be performed within it. Similarly, the depth of the FPU stack allows most floating 
point expressions to be evaluated within it, employing the CPU stack to form addresses for the operands. 

No hardware is used to deal with stack overflow. A compiler can easily examine expressions and introduce 
temporary variables in memory to avoid stack overflow. The number of such temporary variables can be 
minimised by careful choice of the evaluation order; an algorithm to perform this optimisation is given in the 
Prentice Hall publication Transputer Instruction Set - A Compiler Writers' Guide. The algorithm is used to 
optimise the use of the integer stack of the transputer CPU. 

3.7.2 Concurrent operation of FPU and CPU 

In the floating point transputer the FPU operates concurrently with the CPU. This means that it is possible to 
perform an address calculation in the CPU whilst the FPU performs a floating point calculation. This can lead 
to significant performance improvements in real applications which access arrays heavily. This aspect of the 
floating point transputer's performance was carefully assessed, partly through examination of the 'Livermore 
Loops' (refer to The Livermore Fortran Kernels: A Computer Test of the Numerical Performance Range). 
These are a collection of small kernels designed to represent the types of calculation performed on super- 
computers. They are of interest because they contain constructs which occur in real programs which are 
not represented in such programs as the Whetstone benchmark. In particular, they contain accesses to two 
and three-dimensional arrays, operations where the concurrency within the floating point transputer is used 
to good effect. In some cases the compiler is able to choose the order of performing address calculations so 
as to maximise overlapping; this involves a modification of the algorithm mentioned earlier. 

As a simple example of overlapping consider the implementation of Livermore Loop 7. The OCCam program 
for loop 7 is as follows: 

-- LIVERMORE LOOP 7 

SEQ k = FOR n 

x[k] := u[k] + ((( r*(z[k] + (r*y[k]))) + 

(t*((u[k+3] + (r*(u[k+2] + (r*u[k+l]) )))))) + 
(t*((u[k+6] + (r*(u[k+5] + (r*u[k+4]) )))))) 

The first stage in the computation of this is to load the value y[k]. This requires a sequence of four 
instructions. A further three instructions cause r to be loaded and the FPU multiply to be initiated. 

Although the floating point multiplication takes several cycles to complete, the CPU is able to continue exe- 
cuting instructions whilst the FPU performs the multiplication. Thus the CPU can execute the next segment 
of code which computes the address of z [k] whilst the FPU perfroms the multiplication. 

Finally, the value z [k] is pushed onto the floating point stack and added to the previously computed subex- 
pression r*y [k] . It is not until value z [k] is loaded that the CPU needs to synchronise with the FPU. 

The computation of the remainder of the expression proceeds in the same way, and the FPU never has to 
wait for the CPU to perform an address calculation. 
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3.8 



Floating point unit design 



In designing a concurrent systems component such as a transputer, it is important to maximise the per- 
formance obtained from a given area of silicon; many components can be used together to deliver more 
performance. This contrasts with the design of a conventional co-processor where the aim is to maximise 
the performance of a single processor by the use of a large area of silicon. As a result, in designing the 
floating point transputer, the performance benefits of silicon hungry devices such as barrel shifters and flash 
multipliers were carefully examined. 

A flash multiplier is too large to fit on chip together with the processor, and would therefore necessitate the 
use of a separate co-processor chip. The introduction of a co-processor interface to a separate chip slows 
down the rate at which operands can be transferred to and from the floating point unit. Higher performance 
can, therefore, be obtained from a slow multiplier on the same chip as the processor than from a fast one 
on a separate chip. This leads to an important conclusion: a separate co-processor chip is not appropriate 
for scalar floating point arithmetic. A separate co-processor would be effective where a large amount of 
work can be handed to the co-processor by transferring a small amount of information; for example a vector 
co-processor would require only the addresses of its vector operands to be transferred via the co-processor 
interface. 

It turns out that a flash multiplier also operates much more quickly than is necessary. Only a pipelined vector 
processor can deliver operands at a rate consistent with the use of such devices. In fact, any useful floating 
point calculation involves more operand accesses than operations. As an example consider the assignment 
y[i] := y[i] + (t * x[i]) which constitutes the core of the UNPACK floating point benchmark. 
To perform this it is necessary to load three operands, perform two operations and to store a result. If we 
assume that it takes twice as long to perform a floating point operation as to load or store a floating point 
number then the execution time of this example would be evenly split between operand access time and 
operation time. This means that there would be at most a factor of two available in performance improvement 
from the use of an infinitely fast floating point unit! 

Unlike a flash multiplier, a fast normalising shifter is important for fast floating point operation. When imple- 
menting IEEE arithmetic it may be necessary to perform a long shift on every floating point operation and 
unless a fast shifter is incorporated into the floating point unit the maximum operation time can become very 
long. Fortunately, unlike a flash multiplier, it is possible to design a fast shifter in a reasonable area of silicon. 
The shifter used is designed to perform a shift in a single cycle and to normalise in two cycles. 

Consequently, the floating point unit contains a fast normalising shifter but not a flash multiplier. However 
there is a certain amount of logic devoted to multiplication and division. Multiplication is performed three-bits 
per cycle, and division is performed two-bits per cycle. Figure 3.15 illustrates the physical layout of the floating 
point unit. 
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Figure 3.15 Floating point unit block diagram 

The datapaths contain registers and shift paths. The fraction datapath is 59 bits wide, and the exponent 
data path is 13 bits wide. The normalising shifter interfaces to both the fraction data path and the exponent 
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datapath. This is because the data to be shifted will come from the fraction datapath whilst the magnitude 
of the shift is associated with the exponent datapath. One further interesting aspect of the design is the 
microcode ROM. Although the diagram shows two ROMs, they are both part of the same logical ROM. This 
has been split in two so that control signals do not need to be bussed through the datapaths. 

3.9 Graphics capability 

The fast block move instructions of the transputers make them suitable for use in graphics applications 
using byte-per-pixel colour displays. The block move on the transputer is designed to saturate the memory 
bandwidth, moving any number of bytes from any byte boundary in memory to any other byte boundary using 
the smallest possible number of word read and write operations. 

Some transputers extend this capability by incorporation of a two-dimensional version of the block move 
(Move2d) which can move windows around a screen at full memory bandwidth, and conditional versions of 
the same block move which can be used to place templates and text into windows. One of these operations 
(Draw2d) copies bytes from source to destination, writing only non-zero bytes to the destination. A new object 
of any shape can therefore be drawn on top of the current image. A further operation (Clip2d) copies only 
zero bytes in the source. All of these instructions achieve the speed of the simple move instruction, enabling 
a 1 million pixel screen to be drawn many times per second. Unlike the conventional 'bit-bit' instruction, it is 
never necessary to read the destination data. 

3.9.1 Example - drawing coloured text 

Drawing proportional spaced text provides a simple example of the use of the two-dimensional move instruc- 
tions. The font is stored in a two dimensional array Font; the height of Font is the fixed character height, 
and the start of each character is defined by an array start. The textures of the character and its back- 
ground are selected from an array of textures; the textures providing a range of colours or even stripes and 
tartans! 

An Occam procedure to perform such drawing is given below and the effect of each stage in the drawing 
process is illustrated by the diagrams on the final page of this document. First, (1 ) the texture for the character 
is selected and copied to a temporary area and (2) the character in the font is used to clip this texture to the 
appropriate shape. Then (3) the background texture is selected and copied to the screen, and (4) the new 
character drawn on top of it. 

— Draw character ch in texture F on background texture B 
PROG DrawChar(VAL INT Ch, F, B) 
SEQ 
IF 

(x + width [ch]) > screenwidth 
SEQ 

X := 

y := y + height 
(x + width [ch]) <= screenwidth 
SKIP 
[height] [maxwidth] BYTE Temp : 
SEQ 

Move2d (Texture [F] , 0, 0, Temp, 0,0, width [ch] , height) 
Clip2d (Font [ch] , start [ch] , 0, Temp, 0,0, width [ch] , height) 
Move2d (Texture [B] , 0, 0, Screen, x,y, width [ch] , height) ) 
Draw2d(Temp, 0, 0, Screen, x,y, width [ch] , height) 
X := X + width [ch] 

This procedure will fill a 1 million pixel screen with proportionally spaced characters in about 1/6 second. 
Obviously, a simpler and faster version could be used if the character colour or background colour was 
restricted. The operation of this procedure is illustrated in figure 3.16. 
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Figure 3.16 Use of enhanced graphics instructions 
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4 Conclusion 

The INMOS transputer family is a range of system components which can be used to construct high per- 
formance concurrent systems. As all members of the family incorporate INMOS communications links, a 
system may be constructed from different members of the family. All transputers provide hardware support 
for concurrency and offer exceptional performance on process scheduling, inter-process communication and 
inter-transputer communication. 

The design of the transputers takes careful note of silicon economics. The central processor used in the 
transputer offers a performance comparable with that of other VLSI processors several times larger. The 
small size of the processor allows a memory and communications system to be integrated on to the same 
VLSI device. This level of integration allows very fast access to memory and very fast inter-transputer 
communication. Similarly, the transputer floating point unit is integrated into the same device as the central 
processor, eliminating the delays inherent in communicating data between devices. 
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1 Introduction 

The IMS T805 transputer is a 32 bit CMOS microcomputer with a 64 bit floating point unit and graphics support. 
It has 4 Kbytes on-chip RAM for high speed processing, a configurable memory interface and four standard 
INMOS communication links. The instruction set achieves efficient implementation of high level languages 
and provides direct support for the OCCam model of concurrency when using either a single transputer or a 
network. Procedure calls, process switching and typical interrupt latency are sub-microsecond. 

For convenience of description, the IMS T805 operation is split into the basic blocks shown in figure 1.1. 
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Figure 1.1 IMS T805 block diagram 



The processor speed of a device can be pin-selected in stages from 17.5 MHz up to the maximum allowed 
for the part. A device running at 30 MHz achieves an instruction throughput of 30 MIPS peak and 15 MIPS 
sustained. The extended temperature version of the device complies with MIL-STD-883C. 

The IMS T805 provides high performance arithmetic and floating point operations. The 64 bit floating point unit 
provides single and double length operation to the ANSI-IEEE 754-1985 standard for floating point arithmetic. 
It is able to perform floating point operations concurrently with the processor, sustaining a rate of 2.2 Mflops 
at a processor speed of 20 MHz and 3.3 Mflops at 30 MHz. 
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High performance graphics support is provided by microcoded block move instructions which operate at the 
speed of memory. The two-dimensional block move instructions provide for contiguous block moves as well 
as block copying of either non-zero bytes of data only or zero bytes only. Block move instructions can be used 
to provide graphics operations such as text manipulation, windowing, panning, scrolling and screen updating. 

Cyclic redundancy checking (CRC) instructions are available for use on arbitrary length serial data streams, 
to provide error detection where data integrity is critical. Another feature of the IMS T805, useful for pattern 
recognition, is the facility to count bits set in a word. 

The IMS T805 can directly access a linear address space of 4 Gbytes. The 32 bit wide memory interface 
uses multiplexed data and address lines and provides a data rate of up to 4 bytes every 100 nanoseconds 
(40 Mbytes/sec) for a 30 MHz device. A configurable memory controller provides all timing, control and DRAM 
refresh signals for a wide variety of mixed memory systems. 

System Services include processor reset and bootstrap control, together with facilities for error analysis. Error 
signals may be daisy-chained in multi-transputer systems. 

The standard INMOS communication links allow networks of transputer family products to be constructed by 
direct point to point connections with no external logic. The IMS T805 links support the standard operating 
speed of 1 Mbits/sec, but also operate at 5 or 20 Mbits/sec. Each link can transfer data bi-directionally at 
up to 2.35 Mbytes/sec. 

The IMS T805 is pin compatible with the IMS T800, as the extra inputs used are all held to ground on the 
IMS T800. The IMS T805-20 can thus be plugged directly into a circuit designed for a 20 MHz version of the 
IMS T800. 

The transputer is designed to implement the Occam language, detailed in the Occam Reference Manual, but 
also efficiently supports other languages such as C, Pascal and Fortran. Access to the transputer at machine 
level is seldom required, but if necessary refer to the Transputer Instruction Set - A Compiler Writers' Guide. 
The instruction set of the IMS T805 is the same as that of the IMS T800. 

This data sheet supplies hardware implementation and characterisation details for the IMS T805. It is intended 
to be read in conjunction with the Transputer Architecture chapter, which details the architecture of the 
transputer and gives an overview of Occam. 

The IMS T805 instruction set contains a number of instructions to facilitate the implementation of breakpoints. 
For further information concerning breakpointing, refer to Support for debugging/breakpointing in transputers 
(technical note 61). 

Figure 1.2 shows the internal datapaths for the IMS T805. 
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Figure 1.2 IMS T805 internal datapatlis 
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P;n designations 



Table 2.1 IMS T805 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPlus, CapMinus 




External capacitor for internal clock power supply 


Clockln 


in 


Input clock 


ProcSpeedSelectO-2 


in 


Processor speed selectors 


Reset 


in 


System reset 


Error 


out 


Error indicator 


Errorin 


in 


Error daisychain input 


Analyse 


in 


Error analysis 


BootFromRom 


in 


Boot from external ROM or from link 


DIsablelntRAM 


in 


Disable internal RAM 



Table 2.2 IMS T805 external memory interface 



Pin 


In/Out 


Function 


ProcClockOut 


out 


Processor clock 


Mem not Wr DO 


in/out 


Multiplexed data bit and write cycle warning 


MemnotRfDI 


in/out 


Multiplexed data bit 1 and refresh warning 


MemAD2-31 


in/out 


Multiplexed data and address bus 


notMemRd 


out 


Read strobe 


notMemWrBO-3 


out 


Four byte-addressing write strobes 


notMemSO-4 


out 


Five general purpose strobes 


notMemRf 


out 


Dynamic memory refresh indicator 


RefreshPendIng 


out 


Dynamic refresh is pending 


MemWalt 


in 


Memory cycle extender 


MemReq 


in 


Direct memory access request 


MemG ranted 


out 


Direct memory access granted 


MemConfig 


in 


Memory configuration data input 



Table 2.3 IMS T805 event 



Pin 


In/Out 


Function 


EventReq 
EventAck 
Event Waiting 


in 

out 
out 


Event request 

Event request acknowledge 

Event input requested by software 



Table 2.4 IMS T805 link 



Pin 


In/Out 


Function 


LlnkInO-3 


in 


Four serial data input channels 


LinkOutO-3 


out 


Four serial data output channels 


LlnkSpecial 


in 


Select non-standard speed as 5 or 20 Mbits/sec 


LInkOSpecial 


in 


Select special speed for Link 


Link123Special 


in 


Select special speed for Links 1 ,2,3 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 120. 
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3 Processor 

The 32 bit processor contains instruction processing logic, instruction and work pointers, and an operand 
register. It directly accesses the high speed 4 Kbyte on-chip memory, which can store data or program. 
Where larger amounts of memory or programs in ROM are required, the processor has access to 4 Gbytes 
of memory via the External Memory Interface (EMI). 



3.1 



Registers 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; six registers are used in the execution of a sequential process. The small number of 
registers, together with the simplicity of the instruction set, enables the processor to have relatively simple 
(and fast) data-paths and control logic. The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The A, Band C registers which form an evaluation stack. 

A, B and C are sources and destinations for most arithmetic and logical operations. Loading a value into the 
stack pushes B into C, and A into B, before loading A. Storing a value from A, pops B into A and C into B. 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the add instruction adds the top two values in the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 

Any location in memory can be accessed relative to the workpointer register, enabling the workspace to be 
of any size. 

Further register details are given in Transputer Instruction Set - A Compiler Writers' Guide. 
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Figure 3.1 Registers 
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3.2 



Instructions 



The instruction set has been designed for simple and efficient compilation of high-level languages. All in- 
structions have the same format, designed to give a compact representation of the operations occurring most 
frequently in programs. 

Each instruction consists of a single byte divided into two 4-bit parts. The four most significant bits of the byte 
are a function code and the four least significant bits are a data value. 



Function 


Data 1 


7 4 3 

1' 


Operand Register 





Figure 3.2 Instruction format 



3.2.1 



Direct functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Ten of these, 
shown in table 3.1, are used to encode the most important functions. 

Table 3.1 Direct functions 



load constant 


add constant 




load local 


store local 


load local pointer 


load non-local 


store non-local 




jump 


conditional jump 


call 



The most common operations in a program are the loading of small literal values and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local an6 store /oca/ instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of high level programming languages such as OCCam, C, Fortran, Pascal or ADA. 



3.2.2 



Prefix functions 



Two more function codes allow the operand of any instruction to be extended in length; prefix and negative 
prefix. 

All instructions are executed by loading the four data bits into the least significant four bits of the operand 
register, which is then used as the instruction's operand. All instructions except the prefix instructions end by 
clearing the operand register, ready for the next instruction. 

The prefix instruction loads its four data bits into the operand register and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 
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The use of prefix instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form independent of the processor 
wordlength. 



3.2.3 



Indirect functions 



The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate instruction just like any 
other. The instruction representation therefore provides for an indefinite number of operations. 

Encoding of the indirect functions is chosen so that the most frequently occurring operations are represented 
without the use of a prefix instruction. These include arithmetic, logical and comparison operations such as 
add, exclusive or and greater than. Less frequently occurring operations have encodings which require a 
single prefix operation. 

3.2.4 Expression evaluation 

Evaluation of expressions sometimes requires use of temporary variables in the workspace, but the number 
of these can be minimised by careful choice of the evaluation order. 

Table 3.2 Expression evaluation 



Program 


Mnemonic 


X :=0 


Idc 

StI X 


X := #24 


pfix 2 
Idc 4 

StI X 


X := y + z 


Idl y 
Idl z 
add 

StI X 



3.2.5 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte; that is, without 
the use of prefix instructions. Many of these instructions, such as load constant and add require just one 
processor cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive four instructions for every fetch. 

Short instructions also improve the effectiveness of instruction pre-fetch, which in turn improves processor 
performance. There is an extra word of pre-fetch buffer, so the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer is short, there is little time penalty when a jump instruction causes 
the buffer contents to be discarded. 
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3.3 



Processes and concurrency 



A process starts, performs a number of actions, and then either stops without completing or terminates 
complete. Typically, a process is a sequence of instructions. A transputer can run several processes in 
parallel (concurrently). Processes may be assigned either high or low priority, and there may be any number 
of each (page 56). 

The processor has a microcoded scheduler which enables any number of concurrent processes to be exe- 
cuted together, sharing the processor time. This removes the need for a software kernel. 



At any time, a concurrent process may be 

Active 



Inactive 



Being executed. 

On a list waiting to be executed. 

Ready to input. 

Ready to output. 

Waiting until a specified time. 



The scheduler operates in such a way that inactive processes do not consume any processor time. It allocates 
a portion of the processor's time to each process in turn. Active processes waiting to be executed are held 
in two linked lists of process workspaces, one of high priority processes and one of low priority processes 
(page 56). Each list is implemented using two registers, one of which points to the first process in the list, 
the other to the last. In the Linked Process List figure 3.3, process S is executing and P, Q and R are active, 
awaiting execution. Only the low priority process queue registers are shown; the high priority process ones 
perform in a similar manner. 





Registers 






Locals 






Program 






FPtn (Front) 


>. 


P 


>- 






BPtn (Back) 




1 










Q 




>^ 














' 


^ 


A 




1 






R 










B 




-^ 










C 


S 










Workspace 






Next Inst 










'^ 


Operand 

























Figure 3.3 Linked process list 



Table 3.3 Priority queue control registers 



Function 



Pointer to front of active process list 
Pointer to back of active process list 



High Priority 



Low Priority 



FptrO 
BptrO 



Fptn 
Bptn 



Each process runs until it has completed its action, but is descheduled whilst waiting for communication from 
another process or transputer, or for a time delay to complete. In order for several processes to operate in 
parallel, a low priority process is only permitted to run for a maximum of two time slices before it is forcibly 
descheduled at the next descheduling point (page 60). The time slice period is 5120 cycles of the external 
5 MHz clock, giving ticks approximately 1 ms apart. 
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A process can only be descheduled on certain instructions, known as descheduling points (page 60). As a 
result, an expression evaluation can be guaranteed to execute without the process being timesliced partway 
through. 

Whenever a process is unable to proceed, its instruction pointer is saved in the process workspace and 
the next process taken from the list. Process scheduling pointers are updated by instructions which cause 
scheduling operations, and should not be altered directly. Actual process switch times are less than 1 fis, as 
little state needs to be saved and it is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model, including start process 
and end process. When a main process executes a parallel construct, start process instructions are used 
to create the necessary additional concurrent processes. A start process instruction creates a new process 
by adding a new workspace to the end of the scheduling list, enabling the new concurrent process to be 
executed together with the ones already being executed. When a process is made active it is always added 
to the end of the list, and thus cannot pre-empt processes already on the same list. 

The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the parallel construct components which have still to terminate. The 
counter is initialised to the number of components before the processes are started. Each component ends 
with an end process instruction which decrements and tests the counter. For all but the last component, the 
counter is non zero and the component is descheduled. For the last component, the counter is zero and the 
main process continues. 

3.4 Priority 

The IMS T805 supports two levels of priority. Priority 1 (low priority) processes are executed whenever there 
are no active priority (high priority) processes. 

High priority processes are expected to execute for a short time. If one or more high priority processes are 
able to proceed, then one is selected and runs until it has to wait for a communication, a timer input, or until 
it completes processing. 

If no process at high priority is able to proceed, but one or more processes at low priority are able to proceed, 
then one is selected. 

Low priority processes are periodically timesliced to provide an even distribution of processor time between 
computationally intensive tasks. 

If there are n low priority processes, then the maximum latency from the time at which a low priority process 
becomes active to the time when it starts processing is 2n-2 timeslice periods. It is then able to execute for 
between one and two timeslice periods, less any time taken by high priority processes. This assumes that 
no process monopolises the transputer's time; i.e. it has a distribution of descheduling points (page 60). 

Each timeslice period lasts for 5120 cycles of the external 5 MHz input clock (approximately 1 ms at the 
standard frequency of 5 MHz). 

If a high priority process is waiting for an external channel to become ready, and if no other high priority 
process is active, then the interrupt latency (from when the channel becomes ready to when the process 
starts executing) is typically 19 processor cycles, a maximum of 78 cycles (assuming use of on-chip RAM). 
If the floating point unit is not being used at the time then the maximum interrupt latency is only 58 cycles. 
To ensure this latency, certain instructions are interruptable. 

3.5 Communications 

Communication between processes is achieved by means of channels. Process communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 
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A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
links. The processor provides a number of operations to support message passing, the most important being 
input message and output message. 

The input message and output message instructions use the address of the channel to determine whether 
the channel is internal or external. Thus the same instruction sequence can be used for both, allowing a 
process to be written and compiled without knowledge of where its channels are connected. 

The process which first becomes ready must wait until the second one is also ready. A process performs an 
input or output by loading the evaluation stack with a pointer to a message, the address of a channel, and 
a count of the number of bytes to be transferred, and then executing an input message or output message 
instruction. Data is transferred if the other process is ready. If the channel is not ready or is an external one 
the process will deschedule. 

3.6 Block move 

The block move on the transputer moves any number of bytes from any byte boundary in memory, to any 
other byte boundary, using the smallest possible number of word read, and word or part-word writes. 

A block move instruction can be interrupted by a high priority process. On interrupt, block move is completed 
to a word boundary, independent of start position. When restarting after interrupt, the last word written is 
written again. This appears as an unnecessary read and write in the simplest case of word aligned block 
moves, and may cause problems with FIFOs. This problem can be overcome by incrementing the saved 
destination (BreglntSaveLoc) and source pointer (CreglntSaveLoc) values by BytesPerWord during the high 
priority process. 

3.7 Timers 

The transputer has two 32 bit timer clocks which 'tick' periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a specific time. 

One timer is accessible only to high priority processes and is incremented every microsecond, cycling com- 
pletely in approximately 4295 seconds. The other is accessible only to low priority processes and is incre- 
mented every 64 microseconds, giving exactly 15625 ticks in one second. It has a full period of approximately 
76 hours. 

Table 3.4 Timer registers 



ClockO 
Clockl 
TNextRegO 
TNextRegl 



Current value of high priority (level 0) process clock 

Current value of low priority (level 1) process clock 

Indicates time of earliest event on high priority (level 0) timer queue 

Indicates time of earliest event on low priority (level 1) timer queue 



The current value of the processor clock can be read by executing a load timer instruction. A process can 
arrange to perform a timer input, in which case it will become ready to execute after a specified time has 
been reached. The //mer //ipuf instruction requires a time to be specified. If this time is in the 'past' then the 
instruction has no effect. If the time is in the 'future' then the process is descheduled. When the specified 
time is reached the process is scheduled again. 
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Figure 3.4 shows two processes waiting on the timer queue, one waiting for time 21 , the other for time 31 . 
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Figure 3.4 Timer registers 
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4 Instruction set summary 

The Function Codes table 4.8 gives the basic function code set (page 53). Where the operand is less than 16, 
a single byte encodes the complete instruction. If the operand is greater than 15, one prefix instruction (pfix) 
is required for each additional four bits of the operand. If the operand is negative the first prefix instruction 
will be nfix. 

Table 4'.1 prefix cooing 









Function 


Memory 


Mnemonic 


code 


code 


Idc 




#3 


#4 


#43 


Idc 




#35 






is coded 


as 








pfix 




#3 


#2 


#23 


Idc 




#5 


#4 


#45 


Idc 




#987 






is coded 


as 








pfix 




#9 


#2 


#29 


pfix 




#8 


#2 


#28 


Idc 




#7 


#4 


#47 


Idc 




-31 (Idc 


#FFFFFFE1) 




is coded 


as 








nfix 




#1 


#6 


#61 


Idc 




#1 


#4 


#41 



Tables 4.9 to 4.28 give details of the operation codes. Where an operation code is less than 16 (e.g. add: 
operation code 05), the operation can be stored as a single byte comprising the operate function code F and 
the operand (5 in the example). Where an operation code is greater than 15 (e.g. ladd: operation code 16), 
the prefix function code 2 is used to extend the instruction. 



Table 4.2 operate coding 



Mnemonic 



Function 
code 



Memory 
code 



add 
is coded as 

opr 

ladd 
is coded as 

pfix 
opr 



(op. code #5) 

add 

(op. code #1 6) 

#1 
#6 



#F 



#2 
#F 



#F5 

#F5 

#21 F6 

#21 
#F6 



The load device identity (Iddevid) instruction (table 4.20) pushes the device type identity into the A register. 
Each product is allocated a unique group of numbers for use with the /dcfewc/ instruction. The product identity 
numbers for the IMS T805 are 10 to 19 inclusive. 



In the Floating Point Operation Codes tables 4.22 to 4.28, a selector sequence code (page 69) is indicated 
in the Memory Code column by s. The code given in the Operation Code column is the indirection code, the 
operand for the Idc instruction. 

The FPU and processor operate concurrently, so the actual throughput of floating point instructions is better 
than that implied by simply adding up the instruction times. For full details see Transputer Instruction Set - A 
Compiler Writers' Guide. 
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The Processor Cycles column refers to the number of periods TPCLPCL taken by an instruction executing 
in internal memory. The number of cycles is given for the basic operation only; where the memory code 
for an instruction is two bytes, the time for the prefix function (one cycle) should be added. For a 20 MHz 
transputer one cycle is 50 ns. Some instruction times vary. Where a letter is included in the cycles column it 
is interpreted from table 4.3. 



Table 4.3 Instruction set interpretation 



Ident 



Interpretation 



b 

m 

n 

w 

P 
r 



Bit number of the highest bit set in register A. Bit is the least significant bit. 

Bit number of the highest bit set in the absolute value of register A. 
Bit is the least significant bit. 

Number of places shifted. 

Number of words in the message. Part words are counted as full words. If the message 
is not word aligned the number of words is increased to include the part words at either 
end of the message. 

Number of words per row. 

Number of rows. 



The DE column of the tables indicates the descheduling/error features of an instruction as described in 
table 4.4. 

Table 4.4 Instruction features 



Ident 


Feature 


See page: 


D 
E 
F 


The instruction is a descheduling point 
The instruction will affect the Error flag 
The instruction will affect the FP_Error flag 


60 
61,76 
69, 61 



4.1 



Descheduling points 



The instructions in table 4.5 are the only ones at which a process may be descheduled (page 55). They are 
also the ones at which the processor will halt if the Analyse pin is asserted (page 75). 



Table 4.5 Descheduling point instructions 



input message 
timer alt wait 
jump 



output message 
timer input 
loop end 



output byte 
stop on error 
end process 



output word 
alt wait 
stop process 
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4.2 



Error instructions 



The instructions in table 4.6 are tine only ones which can affect the Error flag (page 76) directly. Note, 
however, that the floating point unit error flag FP.Error'is set by certain floating point instructions (page 61), 
and that Error can be set from this flag by fpcheckerron 

Table 4.6 Error setting instructions 



add 


add constant 


subtract 




multiply 


fractional multiply 


divide 


remainder 


long add 


long subtract 


long divide 




set error 


testerr 


fpcheckerror 




check word 


check subscript from 


check single 


check count from 1 



4.3 



Debugging support 



Table 4.21 contains a number of instructions to facilitate the implementation of breakpoints. These instructions 
overload the operation of y'O. Normally yO is a no-op which might cause descheduling. SetjObreak enables the 
breakpointing facilities and causes jO to act as a breakpointing instruction. When breakpointing is enabled, 
jO swaps the current Iptr and Wptr with an Iptr and Wptr stored above MemStart. The breakpoint instruction 
does not cause descheduling, and preserves the state of the registers. It is possible to single step the pro- 
cessor at machine level using these instructions. Refer to Support for debugging/breakpointing in transputers 
(technical note 61) for more detailed information regarding debugger support. 



4.4 



Floating point errors 



The instructions in table 4.7 are the only ones which can affect the floating point error flag FP.Error {page 69). 
Error is set from this flag by fpcheckerror if FP.Error is set. 



Table 4.7 Floating point error setting instructions 



fpadd 


fpsub 


fpmul 


fpdiv 


fpldnladdsn 


fpldnladddb 


fpldnlmulsn 


fpldnlmuldb 


fpremfirst 


fpusqrtfirst 


fpgt 


fpeq 


fpuseterror 


fpuclearerror 


fptesterror 




fpuexpincby32 


fpuexpdecby32 


fpumulby2 


fpudivby2 


fpur32tor64 


fpur64tor32 


fpucki32 


fpucki64 


fprtoi32 


fpuabs 


fpint 
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Table 4.8 IMS T805 function codes 



Function 


IVIemory 




Processor 




D 


Code 


Code 


IVInemonic 


Cycles 


Name 


E 





OX 


J 


3 


jump 


D 


1 


1X 


Idip 


1 


load local pointer 




2 


2X 


pfix 


1 


prefix 




3 


3X 


Idnl 


2 


load non-local 




4 


4X 


Idc 


1 


load constant 




5 


5X 


Idnip 


1 


load non-local pointer 




6 


6X 


nfix 


1 


negative prefix 




7 


7X 


Idl 


2 


load local 




8 


8X 


adc 


1 


add constant 


E 


9 


9X 


call 


7 


call 




A 


AX 


cj 


2 

4 


conditional jump (not taken) 
conditional jump (taken) 




B 


BX 


ajw 


1 


adjust workspace 




C 


OX 


eqc 


2 


equals constant 




D 


DX 


sti 


1 


store local 




E 


EX 


stnl 


2 


store non-local 




F 


FX 


opr 


- 


operate 





Table 4.9 IMS T805 arithmetic/logical operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


46 


24F6 


and 


1 


and 




4B 


24FB 


or 


1 


or 




33 


23F3 


xor 


1 


exclusive or 




32 


23F2 


not 


1 


bitwise not 




41 


24F1 


shI 


n+2 


shift left 




40 


24F0 


shr 


n+2 


shift right 




05 


F5 


add 


1 


add 


E 


OC 


FC 


sub 


1 


subtract 


E 


53 


25F3 


mul 


38 


multiply 


E 


72 


27F2 


fmul 


35 


fractional multiply (no rounding) 


E 








40 


fractional multiply (rounding) 


E 


2C 


22FC 


div 


39 


divide 


E 


1F 


21 FF 


rem 


37 


remainder 


E 


09 


F9 


gt 


2 


greater than 




04 


F4 


diff 


1 


difference 




52 


25F2 


sum 


1 


sum 




08 


F8 


prod 


b+4 
m+5 


product for positive register A 
product for negative register A 
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Table 4.10 IMS T805 long arithmetic operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


16 


21 F6 


ladd 


2 


long add 


E 


38 


23 F8 


Isub 


2 


long subtract 


E 


37 


23 F7 


Isum 


3 


'ong sum 




4F 


24FF 


Idiff 


3 


long diff 




31 


23F1 


Imul 


33 


long multiply 




1A 


21 FA 


Idiv 


35 


long divide 


E 


36 


23F6 


IshI 


n+3 
n-28 


long shift left (n<32) 
long shift left(n>32) 




35 


23F5 


Ishr 


n+3 
n-28 


long shift right (n<32) 
long shift right (n>32) 




19 


21 F9 


norm 


n+5 
n-26 

3 


normalise (n<32) 
normalise (n>32) 
normalise (n=64) 





Table 4.11 IMS T805 general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


00 


FO 


rev 


1 


reverse 




3A 


23FA 


xword 


4 


extend to word 




56 


25F6 


cword 


5 


check word 


E 


ID 


21 FD 


xdble 


2 


extend to double 




4C 


24FC 


csngi 


3 


check single 


E 


42 


24F2 


mint 


1 


minimum integer 




5A 


25FA 


dup 


1 


duplicate top of stack 




79 


27F9 


pop 


1 


pop processor stack 





Table 4.12 IMS T805 2D block move operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


5B 


25FB 


move2dinit 


8 


initialise data for 2D block move 




5C 


25FC 


move2dall 


(2p+23)*r 


2D block copy 




5D 


25FD 


move2dnonzero 


(2p+23)*r 


2D block copy non-zero bytes 




5E 


25FE 


move2dzero 


(2p+23)*r 


2D block copy zero bytes 





Table 4.13 IMS T805 ORG and bit operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


74 
75 

76 
77 
78 


27F4 
27F5 

27F6 
27F7 
27F8 


crcword 
crcbyte 

bitcnt 

bitrevword 

bitrevnbits 


35 

11 

b+2 

36 

n+4 


calculate crc on word 
calculate crc on byte 

count bits set in word 
reverse bits in word 
reverse bottom n bits in word 
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Table 4.14 IMS T805 indexing/array operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


02 


F2 


bsub 


1 


byte subscript 




OA 


FA 


wsub 


2 


word subscript 




81 


28F1 


wsubdb 


3 


form double word subscript 




34 


23F4 


bent 


2 


byte count 




3F 


23FF 


went 


5 


word count 




01 


F1 


lb 


5 


load byte 




3B 


23FB 


sb 


4 


store byte 




4A 


24FA 


move 


2w+8 


move message 





Table 4.15 IMS T805 timer handling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


22 


22F2 


Idtimer 


2 


load timer 




2B 


22FB 


tin 


30 


timer input (time future) 


D 








4 


timer input (time past) 


D 


4E 


24FE 


talt 


4 


timer alt start 




51 


25F1 


taltwt 


15 


timer alt wait (time past) 


D 








48 


timer alt wait (time future) 


D 


47 


24F7 


enbt 


8 


enable timer 




2E 


22FE 


dist 


23 


disable timer 





Table 4.16 IMS T805 input/output operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


07 
OB 
OF 
OE 


F7 
FB 
FF 
FE 


in 

out 

outword 
outbyte 


2W+19 
2W+19 

23 

23 


input message 
output message 
output word 
output byte 


D 
D 
D 
D 


43 

44 

45 


24F3 
24F4 

24F5 


alt 
altwt 

altend 


2 
5 

17 
4 


alt start 

alt wait (channel ready) 
alt wait (channel not ready) 
alt end 


D 
D 


49 
30 


24F9 
23F0 


enbs 
diss 


3 

4 


enable skip 
disable skip 




12 
48 

2F 


21 F2 
24F8 

22FF 


resetch 
enbe 

disc 


3 
7 
5 
8 


reset channel 
enable channel (ready) 
enable channel (not ready) 
disable channel 
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Table 4.17 IMS T805 control operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


20 


22F0 


ret 


5 


return 




IB 


21 FB 


Idpi 


2 


load pointer to instruction 




30 


23FC 


gajw 


2 


general adjust workspace 




06 


F6 


gcall 


4 


general call 




21 


22F1 


lend 


10 


loop end (loop) 


D 








5 


loop end (exit) 


D 



Table 4.18 IMS T805 scheduling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


OD 


FD 


startp 


12 


start process 


D 


03 


F3 


endp 


13 


end process 


D 


39 


23F9 


runp 


10 


run process 




15 


21 F5 


stopp 


11 


stop process 




IE 


21 FE 


Idpri 


1 


load current priority 





Table 4.19 IMS T805 error handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


13 
4D 


21 F3 
24FD 


csubO 
ccnti 


2 
3 


check subscript from 
check count from 1 


E 
E 


29 

10 


22F9 
21 FO 


testerr 
seterr 


2 
3 

1 


test error false and clear (no error) 
test error false and clear (error) 
set error 


E 


55 
57 


25F5 
25F7 


stoperr 
cirhalterr 


2 

1 


stop on error (no error) 
clear halt-on-error 


D 


58 


25F8 


sethalterr 


1 


set halt-on-error 




59 


25F9 


testhalterr 


2 


test halt-on-error 





Table 4.20 IMS T805 processor initialisation operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


2A 


22FA 


testpranal 


2 


test processor analysing 




3E 


23FE 


saveh 


4 


save high priority queue registers 




3D 


23FD 


savel 


4 


save low priority queue registers 




18 


21 F8 


sthf 




store high priority front pointer 




50 


25F0 


sthb 




store high priority back pointer 




1C 


21 FC 


stif 




store low priority front pointer 




17 


21 F7 


stib 




store low priority back pointer 




54 


25F4 


sttimer 




store timer 




17C 


2127FC 


Iddevid 




load device identity 




7E 


27FE 


Idmemstartval 




load value of memstart address 
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Table 4.21 IMS T805 debugger support codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 





00 


jump 


3 

11 
13 


jump (break not enabled) 

jump (break enabled, high priority) 

jump (break enabled, low priority) 


D 


B1 


2BF1 


break 


9 

11 


break (high priority) 
break (low priority) 




B2 


2BF2 


cIrjObreak 


1 


clear jump break enable flag 




83 


28F3 


setjObreak 


1 


set jump break enable flag 




84 


2BF4 


testjObreak 


2 


test jump break enable flag set 




7k 


27FA 


timerdisableh 


1 


disable high priority timer interrupt 




78 


27FB 


timerdisablel 


1 


disable low priority timer interrupt 




70 


27FC 


timerenableh 


6 


enable high priority timer interrupt 




7D 


27FD 


timerenablel 


6 


enable low priority timer interrupt 





Table 4.22 IMS T805 floating point load/store operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


BE 


28FE 


fpldnlsn 


2 


fp load non-local single 




8A 


28FA 


fpldnldb 


3 


fp load non-local double 




86 


28F6 


fpldnlsni 


4 


fp load non-local indexed single 




82 


28F2 


fpldnldbi 


6 


fp load non-local indexed double 




9F 


29FF 


fpldzerosn 


2 


load zero single 




AO 


2AF0 


fpldzerodb 


2 


load zero double 




AA 


2AFA 


fpldnladdsn 


8/11 


fp load non local & add single 


F 


A6 


2AF6 


fpldnladddb 


9/12 


fp load non local & add double 


F 


AC 


2AFC 


fpldnlmulsn 


13/20 


fp load non local & multiply single 


F 


A8 


2AF8 


fpldnlmuldb 


21/30 


fp load non local & multiply double 


F 


88 


28F8 


fpstnlsn 


2 


fp store non-local single 




84 


28F4 


fpstnldb 


3 


fp store non-local double 




9E 


29FE 


fpstnli32 


4 


store non-local int32 





Processor cycles are shown as Typical/Maximum cycles. 



Table 4.23 IMS T805 floating point general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


A8 
A4 
A3 


2AF8 
2AF4 
2AF3 


fpentry 

fprev 

fpdup 


1 
1 

1 


floating point unit entry 
fp reverse 
fp duplicate 
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Table 4.24 IMS T805 floating point rounding operation codes 



Operation 


l\/lemory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


22 


s 


fpurn 


1 


set rounding mode to round nearest 




06 


s 


fpurz 


1 


set rounding mode to round zero 




04 


s 


fpurp 


1 


set rounding mode to round positive 




05 


s 


fpurm 


1 


set rounding mode to round minus 





Table 4.25 IMS T805 floating point error operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


83 
90 
23 
90 


28 F3 
29F0 

s 

s 


fpchkerror 
fptesterror 
fpuseterror 
fpuclearerror 


1 
2 

1 
1 


check fp error 

test fp error false and clear 

set fp error 

clear fp error 


E 
F 
F 
F 



Table 4.26 IMS T805 floating point comparison operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


94 


29 F4 


fpgt 


4/6 


fp greater than 


F 


95 


29F5 


fpeq 


3/5 


fp equality 


F 


92 


29F2 


fpordered 


3/4 


fp orderability 




91 


29F1 


fpnan 


2/3 


fpNaN 




93 


29F3 


fpnotfinite 


2/2 


fp not finite 




OE 


s 


fpuchki32 


3/4 


check in range of type int32 


F 


OF 


s 


fpuchki64 


3/4 


check in range of type int64 


F 



Processor cycles are shown as Typical/Maximum cycles. 



Table 4.27 IMS T805 floating point conversion operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


07 


s 


fpur32tor64 


3/4 


real32 to real64 


F 


08 


s 


fpur64tor32 


6/9 


real64 to real32 


F 


9D 


29FD 


fprtoi32 


7/9 


real to int32 


F 


96 


29F6 


fpi32tor32 


8/10 


int32 to real32 


' 


98 


29F8 


fpi32tor64 


8/10 


int32 to real64 




9A 


29FA 


fpb32tor64 


8/8 


bit32 to real64 




OD 


s 


fpunoround 


2/2 


real64 to real32, no round 




A1 


2AF1 


fpint 


5/6 


round to floating integer 


F 



Processor cycles are shown as Typical/Maximum cycles. 
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Table 4.28 IMS T805 floating point arithmetic operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor cycles 


Name 


D 

E 


Single 


Double 


87 


28F7 


fpadd 


6/9 


6/9 


fp add 


F 


89 


28F9 


fpsub 


6/9 


6/9 


fp subtract 


F 


8B 


28FB 


fpmul 


11/18 


18/27 


fp multiply 


F 


8C 


28FC 


fpdiv 


16/28 


31/43 


fp divide 


F 


OB 


s 


fpuabs 


2/2 


2/2 


fp absolute 


F 


8F 


28FF 


fpremfirst 


36/46 


36/46 


fp remainder first step 


F 


90 


29F0 


fpremstep 


32/36 


32/36 


fp remainder iteration 




01 


s 


fpusqrtfirst 


27/29 


27/29 


fp square root first step 


F 


02 


s 


fpusqrtstep 


42/42 


42/42 


fp square root step 




03 


s 


fpusqrtlast 


8/9 


8/9 


fp square root end 




OA 


s 


fpuexpinc32 


6/9 


6/9 


multiply by 2^^ 


F 


09 


s 


fpuexpdec32 


6/9 


6/9 


divide by 2^^ 


F 


12 


s 


fpumulby2 


6/9 


6/9 


multiply by 2.0 


F 


11 


s 


fpudivby2 


6/9 


6/9 


divide by 2.0 


F 



Processor cycles are shown as Typical/Maximum cycles. 
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5 Floating point unit 

The 64 bit FPU provides single and double length arithmetic to floating point standard ANSI-IEEE 754-1985. 
It is able to perform floating point arithmetic concurrently with the central processor unit (CPU), sustaining 
3.3 Mflops on a 30 MHz device. All data communication between memory and the FPU occurs under control 
of the CPU. 

The FPU consists of a microcoded computing engine with a three deep floating point evaluation stack for 
manipulation of floating point numbers. These stack registers are FA, FB and FC, each of which can hold 
either 32 bit or 64 bit data; an associated flag, set when a floating point value is loaded, indicates which. The 
stack behaves in a similar manner to the CPU stack (page 52). 

As with the CPU stack, the FPU stack is not saved when rescheduling (page 55) occurs. The FPU can 
be used in both low and high priority processes. When a high priority process interrupts a low priority one 
the FPU state is saved inside the FPU. The CPU will service the interrupt immediately on completing its 
current operation. The high priority process will not start, however, before the FPU has completed its current 
operation. 

Points in an instruction stream where data need to be transferred to or from the FPU are called synchronisation 
points. At a synchronisation point the first processing unit to become ready will wait until the other is ready. 
The data transfer will then occur and both processors will proceed concurrently again. In order to make 
full use of concurrency, floating point data source and destination addresses can be calculated by the CPU 
whilst the FPU is performing operations on a previous set of data. Device performance is thus optimised by 
minimising the CPU and FPU idle times. 

The FPU has been designed to operate on both single length (32 bit) and double length (64 bit) floating 
point numbers, and returns results which fully conform to the ANSI-IEEE 754-1985 floating point arithmetic 
standard. Denormalised numbers are fully supported in the hardware. All rounding modes defined by the 
standard are implemented, with the default being round to nearest. 

The basic addition, subtraction, multiplication and division operations are performed by single instructions. 
However, certain less frequently used floating point instructions are selected by a value in register A (when 
allocating registers, this should be taken into account). A load constant instruction Idc is used to load 
register A; the floating point entry instruction fpentry then uses this value to select the floating point operation. 
This pair of instructions is termed a selector sequence. 

Names of operations which use fpentry begin with fpu. A typical usage, returning the absolute value of a 
floating point number, would be 

Idc fpuabs; fpentry; 

Since the indirection code for fpuabs is OB, it would be encoded as 



Table 5.1 /"peA?//-/ coding 



Mnemonic 



Function 
code 



Memory 
code 



Idc 

fpentry 
is coded as 

pfix 
opr 



fpuabs 

(op. code #AB) 

#A 
#B 



#4 



#2 
#F 



#4B 

#2AFB 

#2A 
#FB 
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The remainder an6 square root instructions take considerably longer than other instructions to complete. In 
order to minimise the interrupt latency period of the transputer they are split up to form instruction sequences. 
As an example, the instruction sequence for a single length square root is 

fpusqrtfirst; fpusqrtstep; fpusqrtstep; fpusqrtlast; 

The FPU has Its own error flag FP.Error. This reflects the state of evaluation within the FPU and is set in 
circumstances where invalid operations, division by zero or overflow exceptions to the ANSI-IEEE 754-1985 
standard would be flagged (page 61). FP.Error\s also set if an input to a floating point operation is infinite 
or is not a number (NaN). The FP_Error flag can be set, tested and cleared without affecting the main Error 
flag, but can also set Error when required (page 61). Depending on how a program is compiled, it is possible 
for both unchecked and fully checked floating point arithmetic to be performed. 

Further details on the operation of the FPU can be found in Transputer Instruction Set - A Compiler Writers' 
Guide. 



Table 5.2 Typical floating point operation times for IMS T805 



Operation 


T805-20 


T805-30 


Single length 


Double length 


Single length 


Double length 


add 

subtract 
multiply 
divide 


350 ns 
350 ns 
550 ns 
850 ns 


350 ns 

350 ns 

1000 ns 

1600 ns 


233 ns 
233 ns 
367 ns 
567 ns 


233 ns 
233 ns 
667 ns 

1067 ns 



Timing is for operations where both operands are normalised fp numbers. 
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6 System services 

System services include all the necessary logic to initialise and sustain operation of the device. They also 
include error handling and analysis facilities. 

6.1 Power 

Power is supplied to the device via the VCC and GND pins. Several of each are provided to minimise 
inductance within the package. All supply pins must be connected. The supply must be decoupled close to 
the chip by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer 
boards are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, otherwise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 

6.2 CapPlus, CapMinus 

The internally derived power supply for internal clocks requires an external low leakage, low inductance 1/xF 
capacitor to be connected between CapPlus and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz and 10 MHz. if a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total RGB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 
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Figure 6.1 Recommended PLL decoupling 



6.3 Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer devices may be connected to a common clock, or may have individual clocks providing 
each one meets the specified stability criteria. In a multi-clock system the relative phasing of Clockln clocks 
is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant provided the 
specified limits of Clockln pulse widths are met. 

Oscillator stability is important. Clockln must be derived from a crystal oscillator; RG oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 6.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1,2.4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1,3 


TDC1DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1.4 


TDCr 


Clockln rise time 






10 


ns 


1,5 


TDCf 


Clockln fall time 






8 


ns 


1.5 



Notes 



1 These parameters are not tested, 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 11. 3). 
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Figure 6.2 Clockln timing 



6.4 



ProcSpeedSelectO-2 



Processor speed of the IMS T805 is variable in discrete steps. The desired speed can be selected, up to the 
maximum rated for a particular component, by the three speed select lines ProcSpeedSelectO-2. The pins 
are tied high or low, according to the table below, for the various speeds. The pins are arranged so that the 
IMS T805 can be plugged directly into a board designed for a IMS T425. 

Only six of the possible speed select combinations are currently used; the other two are not valid speed 
selectors. The frequency of Clockln for the speeds given in the table is 5 MHz. 
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Table 6.2 Processor speed selection 



Proc 
Speed 
Select2 


Proc 
Speed 
Selecti 


Proc 
Speed 
SelectO 


Processor 

Clock 
Speed MHz 


Processor 

Cycle 
Time ns 


Notes 











20.0 


50.0 










1 


22.5 


44.4 







1 





25.0 


40.0 







1 


1 


30.0 


33.3 




1 








35.0 


28.6 




1 





1 






Invalid 


1 


1 





17.5 


57.1 




1 


1 


1 






Invalid 



Note: Inclusion of a speed selection in this table does not imply immediate availability. 



6.5 



Reset 



Reset can go high with VCC, but must at no time exceed the maximum specified voltage for VIM. After VCC is 
valid Clockln should be running for a minimum period TDCVRL before the end of Reset. The falling edge of 
Reset initialises the transputer, triggers the memory configuration sequence and starts the bootstrap routine. 
Link outputs are forced low during reset; link inputs and EventReq should be held low. Memory request 
(DMA) must not occur whilst Reset is high but can occur before bootstrap (page 94). 

After the end of Reset there will be a delay of 144 periods of Clockln (figure 6.3). Following this, the 
MemWrDO, MemRfDI and MemAD2-31 pins will be scanned to check for the existence of a pre-programmed 
memory interface configuration (page 96). This lasts for a further 144 periods of Clockln. Regardless of 
whether a configuration was found, 36 configuration read cycles will then be performed on external memory 
using the default memory configuration (page 98), in an attempt to access the external configuration ROM. 
A delay will then occur, its period depending on the actual configuration. Finally eight complete and con- 
secutive refresh cycles will initialise any dynamic RAM, using the new memory configuration. If the memory 
configuration does not enable refresh of dynamic RAM the refresh cycles will be replaced by an equivalent 
delay with no external memory activity. 

If BootFromRom is high bootstrapping will then take place immediately, using data from external memory; 
othenA/ise the transputer will await an input from any link. The processor will be in the low priority state. 



Reset 

Action 



Delay 



Internal 



External 



configuration configuration 



Delay 



Refresh 



Boot 



Figure 6.3 IMS T805 post-reset sequence 



6.6 



Bootstrap 



The transputer can be bootstrapped either from a link or from external ROM. To facilitate debugging, Boot- 
FromRom may be dynamically changed but must obey the specified timing restrictions. It is sampled once 
only by the transputer, before the first instruction is executed after Reset is taken low. 

If BootFromRom is connected high (e.g. to VCC) the transputer starts to execute code from the top two bytes 
in external memory, at address #7FFFFFFE. This location should contain a backward jump to a program in 
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ROM. Following this access, BootFromRom may be taken low If required. The processor is in the low priority 
state, and the IV register points to MemStart (page 77). 



Table 6.3 Reset and Analyse 



SYIVIBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 


Power valid before Reset 


10 






ms 




TRHRL 


Reset pulse width high 


8 






Clockln 


1 


TDCVRL 


Clockln running before Reset end 


10 






ms 


2 


TAHRH 


Analyse setup before Reset 


3 






ms 




TRLAL 


Analyse hold after Reset end 


1 






Clockln 


1 


TBRVRL 


BootFromRom setup 









ms 




TRLBRX 


BootFromRom hold after Reset 









ms 


3 


TALBRX 


BootFromRom hold after Analyse 










3 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 

3 Must be stable until after end of bootstrap period. See Bootstrap section. 
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Figure 6.4 Transputer reset timing with Analyse low 



Reset 
Analyse 

BootFromRom ~ 






TRHRL 


^ . . I 


/ 


^ ^ 




TAHRH ^ 

) 






^ TRLAL // 


Y 




TBRVRL^ 




TALBRx// 


>- 






^ // 


/ 


^ JJ 


\ 
\ 













Figure 6.5 Transputer reset and analyse timing 
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If BootFromRom is connected low (e.g. to GND) the transputer will wait for the first bootstrap nnessage to 
arrive on any one of its links. The transputer is ready to receive the first byte on a link within two processor 
cycles TPCLPCL after Reset goes low. 

If the first byte received (the control byte) is greater than 1 it is taken as the quantity of bytes to be input. The 
following bytes, to that quantity, are then placed in internal memory starting at location MemStart Following 
reception of the last byte the transputer will start executing code at MemStart as a low priority process. 
BootFromRom may be taken high after reception of the last byte, if required. The memory space immediately 
above the loaded code is used as work space. Messages arriving on other links after the control byte has 
been received and on the bootstrapping link after the last bootstrap byte will be retained until a process inputs 
from them. 

6.7 Peek and poke 

Any location in internal or external memory can be interrogated and altered when the transputer is waiting 
for a bootstrap from link. If the control byte is then eight more bytes are expected on the same link. The 
first four byte word is taken as an internal or external memory address at which to poke (write) the second 
four byte word. If the control byte is 1 the next four bytes are used as the address from which to peek (read) 
a word of data; the word is sent down the output channel of the same link. 

Following such a peek or poke, the transputer returns to its previously held state. Any number of accesses 
may be made in this way until the control byte is greater than 1 , when the transputer will commence reading 
its bootstrap program. Any link can be used, but addresses and data must be transmitted via the same link 
as the control byte. 

6.8 Analyse 

If Analyse is taken high when the transputer is running, the transputer will halt at the next descheduling 
point (page 60). From Analyse being asserted, the processor will halt within three time slice periods plus 
the time taken for any high priority process to complete. As much of the transputer status is maintained as is 
necessary to permit analysis of the halted machine. Processor flags Error, HaltOnError and EnableJOBreak 
are normally cleared at reset on the IMS T805; however, if Analyse is asserted the flags are not altered. 
Memory refresh continues. 

input links will continue with outstanding transfers. Output links will not make another access to memory 
for data but will transmit only those bytes already in the link buffer. Providing there is no delay in link 
acknowledgement, the links should be inactive within a few microseconds of the transputer halting. 

Reset should not be asserted before the transputer has halted and link transfers have ceased. When Reset 
is taken low whilst Analyse is high, neither the memory configuration sequence nor the block of eight refresh 
cycles will occur; the previous memory configuration will be used for any external memory accesses. If 
BootFromRom is high the transputer will bootstrap as soon as Analyse is taken low, otherwise it will await a 
control byte on any link. If Analyse is taken low without Reset going high the transputer state and operation 
are undefined. After the end of a valid Analyse sequence the registers have the values given in table 6.4. 

Table 6.4 Register values after Analyse 



/ MemStart if bootstrapping from a link, or the external memory bootstrap address if 

bootstrapping from ROM. 

W MemStart if bootstrapping from ROM, or the address of the first free word after the 
bootstrap program if bootstrapping from link. 

A The value of /when the processor halted. 

B The value of I// when the processor halted, together with the priority of the process 
when the transputer was halted (i.e. the l^ descriptor). 

C The ID of the bootstrapping link if bootstrapping from link. 
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6.9 



Error, Errorin 



The Error pin carries the OR'ed output of the internal Error flag and the Errorin input. If Error is high 
it indicates either that Errorin is high or that an error was detected in one of the processes. An internal 
error can be caused, for example, by arithmetic overflow, divide by zero, array bounds violation or software 
setting the flag directly (page 61). It can also be set from the floating point unit under certain circumstances 
(page 61, 69). Once set, the E/ror flag is only cleared by executing the instruction testerr. The error is not 
cleared by processor reset, in order that analysis can identify any errant transputer (page 75). 

A process can be programmed to stop if the Error flag is set; it cannot then transmit erroneous data to other 
processes, but processes which do not require that data can still be scheduled. Eventually all processes 
which rely, directly or indirectly, on data from the process in error will stop through lack of data. Errorin does 
not directly affect the status of a processor in any way. 

By setting the HaltOn Error ^lag the transputer itself can be programmed to halt if Error becomes set. If Error 
becomes set after HaltOn Error has been set, all processes on that transputer will cease but will not necessarily 
cause other transputers in a network to halt. Setting HaltOn Error aher Error w\\\ not cause the transputer to 
halt; this allows the processor reset and analyse facilities to function with the flags in indeterminate states. 

An alternative method of error handling is to have the errant process or transputer cause all transputers 
to halt. This can be done by 'daisy-chaining' the Errorin and Error pins of a number of processors and 
applying the final Error output signal to the EventReq pin of a suitably programmed master transputer. Since 
the process state is preserved when stopped by an error, the master transputer can then use the analyse 
function to debug the fault. When using such a circuit, note that the Error flag is in an indeterminate state on 
power up; the circuit and software should be designed with this in mind. 

Error checks can be removed completely to optimise the performance of a proven program; any unexpected 
error then occurring will have an arbitrary undefined effect. 

If a high priority process pre-empts a low priority one, status of the Error and HaltOnError flags is saved for 
the duration of the high priority process and restored at the conclusion of it. Status of both flags is transmitted 
to the high priority process. Either flag can be altered in the process without upsetting the error status of any 
complex operation being carried out by the pre-empted low priority process. 

In the event of a transputer halting because of HaltOnErron the links will finish outstanding transfers before 
shutting down. If Analyse is asserted then all inputs continue but outputs will not make another access to 
memory for data. Memory refresh will continue to take place. 

After halting due to the Error flag changing from to 1 whilst HaltOnError \s set, register /points two bytes 
past the instruction which set Error. After halting due to the Analyse pin being taken high, register / points 
one byte past the instruction being executed. In both cases /will be copied to register A. 
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Figure 6.6 Error handling in a multi-transputer system 
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7 Memory 

The IMS T805 has 4 Kbytes of fast internal static memory for high rates of data throughput. Each inter- 
nal memory access takes one processor cycle ProcClockOut (page 82). The transputer can also access 
4 Gbytes of external memory space. Internal and external memory are part of the same linear address space. 
Internal RAM can be disabled by holding DisablelntRAM high. All internal addresses are then mapped to 
external RAM. This pin should not be altered after Reset has been taken low. 

IMS T805 memory is byte addressed, with words aligned on four-byte boundaries. The least significant byte 
of a word is the lowest addressed byte. 

The bits in a byte are numbered to 7, with bit the least significant. The bytes are numbered from 0, with 
byte the least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Internal memory starts at the most negative address #80000000 and extends to #80000FFF. User memory 
begins at #80000070; this location is given the name MemStart. An instruction Idmemstartval is provided to 
obtain the value of MemStart. 

The context of a process in the transputer model involves a workspace descriptor (WPtr) and an instruction 
pointer (IPtr). WPtr is a word address pointer to a workspace in memory. IPtr points to the next instruction to 
be executed for the process which is the currently executing process. The context switch performed by the 
breakpoint instruction swaps the WPtr and IPtr of the currently executing process with the WPtr and IPtr held 
above MemStart. Two contexts are held above MemStart, one for high priority and one for low priority; this 
allows processes at both levels to have breakpoints. Note that on bootstrapping from a link, these contexts 
are overwritten by the loaded code. If this is not acceptable, the values should be peeked from memory 
before bootstrapping from a link. 

The reserved area of internal memory below MemStart is used to implement link and event channels. 

Two words of memory are reserved for timer use, TPtrLocO \or high priority processes and TPtrLod for low 
priority processes. They either indicate the relevant priority timer is not in use or point to the first process on 
the timer queue at that priority level. 

Values of certain processor registers for the current low priority process are saved in the reserved IntSaveLoc 
locations when a high priority process pre-empts a low priority one. Other locations are reserved for extended 
features such as block moves and floating point operations. 

External memory space starts at #80001000 and extends up through #00000000 to #7FFFFFFF. Memory 
configuration data and ROM bootstrapping code must be in the most positive address space, starting at 
#7FFFFF6C and #7FFFFFFE respectively. Address space immediately below this is conventionally used for 
ROM based code. 
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Figure 7.1 IMS T805 memory map 
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8 External memory interface 

The External Memory Interface (EMI) allows access to a 32 bit address space, supporting dynamic and static 
RAM as well as ROM and EPROM. EMI timing can be configured at Reset to cater for most memory types 
and speeds, and a program is supplied with the Transputer Development System to aid in this configuration. 

There are 17 internal configurations which can be selected by a single pin connection (page 96). If none are 
suitable the user can configure the interface to specific requirements, as shown in page 98. 

The timing parameters in the following tables are based on 17 MHz and 20 MHz parts. Data for higher speeds 
is based on tests on a limited number of samples and may change when full characterisation is completed. 

The external memory cycle is divided into six Tstates with the following functions: 

T1 Address setup time before address valid strobe. 

T2 Address hold time after address valid strobe. 

T3 Read cycle tristate or write cycle data setup. 

T4 Extendable data setup time. 

15 Read or write data. 

16 Data hold. 

Under normal conditions each Tstate may be from one to four periods Tm long, the duration being set during 
memory configuration. The default condition on Reset is that all Tstates are the maximum four periods Tm 
long to allow external initialisation cycles to read slow ROM. 

Period T4 can be extended indefinitely by adding externally generated wait states. 

An external memory cycle is always an even number of periods Tm in length and the start of T1 always 
coincides with a rising edge of ProcClockOut. If the total configured quantity of periods Tm is an odd 
number, one extra period Tm will be added at the end of T6 to force the start of the next T1 to coincide with 
a rising edge of ProcClockOut. This period is designated E in configuration diagrams (figure 8.19). 

During an internal memory access cycle the external memory interface bus MemAD2-31 reflects the word 
address used to access internal RAM, MemnotWrDO reflects the read/write operation and MemnotRfDI is 
high; all control strobes are inactive. This is true unless and until a memory refresh cycle or DMA (memory 
request) activity takes place, when the bus will carry the appropriate external address or data. 

The bus activity Is not adequate to trace the internal operation of the transputer in full, but may be used for 
hardware debugging in conjuction with peek and poke (page 75). 
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Figure 8.1 IMS T805 bus activity for internal memory cycle 
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8.1 Pin functions 

8.1.1 l\/lemAD2-31 

External memory addresses and data are multiplexed on one bus. Only the top 30 bits of address are output 
on the external memory interface, using pins l\/lemAD2-31. They are normally output only during Tstates T1 
and T2, and should be latched during this time. The data bus is 32 bits wide. It uses l\/lemAD2-31 for the 
top 30 bits and IVIemnotRfDI and MemnotWrDO for the lower two bits. 

8.1.2 notlVlemRd 

For a read cycle the read strobe notMemRd is low during 14 and 15. Data is read by the transputer on the 
rising edge of this strobe, and may be removed immediately afterward. If the strobe duration is insufficient it 
may be extended by adding extra periods Tm to either or both of the Tstates T4 and T5. Further extension 
may be obtained by inserting wait states at the end of T4. 

8.1.3 IVIemnotWrDO 

During T1 ahd T2 this pin will be low if the cycle is a write cycle, otherwise it will be high. During Tstates T3 
to T6 it becomes bit of the data bus. In both cases it follows the general timing of l\/IemAD2-31. 

8.1.4 notMemWrBO-3 

Because the transputer uses word addressing, four write strobes are provided; one to write each byte of the 
word. notMemWrBO addresses the least significant byte. 

8.1.5 not[\/lemS0-4 

To facilitate control of different types of memory and devices, the EMI is provided with five strobe outputs, 
four of which can be configured by the user. The strobes are conventionally assigned the functions shown in 
the read and write cycle diagrams, although there is no compulsion to retain these designations. 

8.1.6 IVIemWait 

Wait states can be selected by taking IVIemWait high. Externally generated wait states can be added to 
extend the duration of T4 indefinitely. 

8.1.7 MemnotRfDI 

During T1 and T2, this pin is low if the address on MemAD2-31 is a refresh address, otherwise it is high. 
During Tstates T3 to T6 it becomes bit 1 of the data bus. In both cases it follows the general timing of 
MemAD2-31. 

8.1.8 notMemRf 

The IMS T805 can be operated with memory refresh enabled or disabled. The selection is made during 
memory configuration, when the refresh interval is also determined. 

8.1.9 RefreshPending 

When high, this pin signals that a refresh cycle is pending. 
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Figure 8.2 IMS T805 dynamic RAM application 

8.1.10 MemReq, MemGranted 

Direct memory access (DMA) can be requested at any time by driving the asynchronous MemReq input high. 

MemGranted follows the timing of the bus being tristated and can be used to signal to the device requesting 
the DMA that it has control of the bus. Note that MemGranted changes on the falling edge of ProcClockOut 
and can therefore be sampled to establish control of the bus on the rising edge of ProcClockOut. 



8.1.11 MemConfig 

MemConfig is an input pin used to read configuration data when setting external memory interface (EMI) 
characteristics. 
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8.1 .12 ProcClockOut 

This clock is derived from the internal processor clock, which is in turn derived from Clockln. Its period is 
equal to one internal microcode cycle time, and can be derived from the formula 

TPCLPCL = TDCLDCL / PLLx 

where TPCLPCL is the ProcClocl<Out Period, TDCLDCL is the Clocl<ln Period and PLLx is the phase 
lock loop factor for the relevant speed part, obtained from the ordering details (Ordering section). 

The time value Tm is used to define the duration of Tstates and, hence, the length of external memory cycles; 
its value is exactly half the period of one ProcCloclcOut cycle (0.5*TPCLPCL), regardless of mark/space 
ratio of ProcClockOut. 

Edges of the various external memory strobes coincide with rising or falling edges of ProcClockOut. It should 
be noted, however, that there is a skew associated with each coincidence. The value of skew depends on 
whether coincidence occurs when the ProcClockOut edge and strobe edge are both rising, when both are 
falling or if either is rising when the other is falling. Timing values given in the strobe tables show the best 
and worst cases. If a more accurate timing relationship is required, the exact Tstate timing and strobe edge 
to ProcClockOut relationships should be calculated and the correct skew factors applied from the edge skew 
timing table 8.4. 

Table 8.1 ProcClockOut 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLPCL 


ProcClockOut period 


a-2 


a 


a+2 


ns 


1.5 


TPCHPCL 


ProcClockOut pulse width high 


b-11.5 


b 


b+3.5 


ns 


2,5 


TPCLPCH 


ProcClockOut pulse width low 









ns 


3,5 


Tm 


ProcClockOut half cycle 


b-1 


b 


b+1 


ns 


2,5 


TPCstab 


ProcClockOut stability 






8 


% 


4,5 



Notes 



1 a is TDCLDCL/PLLx. 

2 b is 0.5+TPCLPCL (half the processor clock period). 

3 c is TPCLPCL-TPCHPCL 

4 Stability is the variation of cycle periods between two consecutive cycles, measured at corresponding points on 
the cycles. 



5 This parameter is sampled and not 100% tested. 
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Figure 8.3 IMS T805 ProcClockOut timing 
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8.2 



Read cycle 



Byte addressing is carried out internally by the transputer for read cycles. For a read cycle the read strobe 
notMemRd is low during T4 and T5. Read cycle data may be set up on the data bus at any time after the 
start of T3, but must be valid when the transputer reads it at the end of T5. Data may be removed any time 
during T6, but must be off the bus no later than the end of that period. 

notMemSO Is a fixed format strobe. Its leading edge is always coincident with the start of T2 and its trailing 
edge always coincident with the end of T5. 

The leading edge of notMemSI is always coincident with the start of T2, but its duration may be configured 
to be from zero to 31 periods Tm. Regardless of the configured duration, the strobe will terminate no later 
than the end of T6. The strobe is sometimes programmed to extend beyond the normal end of Tmx. When 
wait states are inserted into an EMI cycle the end of Tmx is delayed, but the potential active duration of the 
strobe is not altered. Thus the strobe can be configured to terminate relatively early under certain conditions 
(page 89). If notMemSI is configured to be zero it will never go low. 

notMemS2, notMemS3 and notMemS4 are identical in operation. They all terminate at the end of T5, but 
the start of each can be delayed from one to 31 periods Tm beyond the start of T2. If the duration of one of 
these strobes would take it past the end of T5 it will stay high. This can be used to cause a strobe to become 
active only when wait states are inserted. If one of these strobes is configured to zero it will never go low. 
Figure 8.6 shows the effect of Wait on strobes in more detail; each division on the scale is one period Tm. 

In the read cycle timing diagrams ProcClockOut is included as a guide only; it is shown with each Tstate 
configured to one period Tm. 

Table 8.2 Read 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaZdV 


Address tristate to data valid 









ns 




TdVRdH 


Data setup before read 


25 






ns 




TRdHdX 


Data hold after read 









ns 




TSOLRdL 


notMemSO before start of read 


a-4 


a 


a+4 


ns 


1 


TSOHRdH 


End of read from end of notMemSO 


-4 




4 


ns 




TRdLRdH 


Read period 


b-3 




b+5 


ns 


2 



Notes 



1 a is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

2 b is total of T4+Twalt+T5 where T4, T5 can be from one to four periods Tm each in length and Twalt may be 
any number of periods Tm in length. 
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V«««««<( 
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TaVSOL TSOLaX 
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notlVlemSO 

(CE) 
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Figure 8.4 IMS T805 external read cycle: static memory 
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Table 8.3 IMS T805 strobe timing 



SYMBOL 


Cn) 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaVSOL 




Address setup before notMemSO 


a-8 






ns 


1 


TSOLaX 




Address hold after notMemSO 


b-8 


b 


b+8 


ns 


2 


TSOLSOH 




notMemSO pulse width low 


0-5 




0+6 


ns 


3 


TS0LS1L 


1 


notMemSI from notMemSO 


-4 




4 


ns 




TS0LS1H 


5 


notMemSI end from notMemSO 


d-1 




d+9 


ns 


4,6 


TS0HS1H 


9 


notMemSI end from notMemSO end 


e-8 




e+4 


ns 


5,6 


TS0LS2L 


2 


notMemS2 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS2H 


6 


notMemS2 end from notMemSO 


0-5 




0+7 


ns 


3 


TS0HS2H 


10 


notMemS2 end from notMemSO end 


-4 




7 


ns 




TS0LS3L 


3 


notMemS3 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS3H 


7 


notMemS3 end from notMemSO 


0-5 




0+7 


ns 


3 


TS0HS3H 


11 


notMemS3 end from notMemSO end 


-4 




7 


ns 




TS0LS4L 


4 


notMemS4 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS4H 


8 


notMemS4 end from notMemSO 


0-5 




0+7 


ns 


3 


TS0HS4H 


12 


notMemS4 end from notMemSO end 


-4 




7 


ns 




Tmx 




Complete external memory cycle 




g 






8 



Notes 



1 a is T1 where T1 can be from one to four periods Tm in length. 

2 b is T2 where T2 can be from one to four periods Tm in length. 

3 c is total of T2+T3+T4+Twalt+T5 where T2, T3, T4, T5 can be from one to four periods Tm each in length and 
Twalt may be any number of periods Tm in length. 

4 d can be from zero to 31 periods Tm in length. 

5 e can be from -27 to +4 periods Tm in length. 

6 If the configuration would cause the strobe to remain active past the end of T6 it will go high at the end of T6. 
If the strobe is configured to zero periods Tm it will remain high throughout the complete cycle Tmx. 

7 f can be from zero to 31 periods Tm in length. If this length would cause the strobe to remain active past the 
end of T5 it will go high at the end of T5. If the strobe value is zero periods Tm it will remain low throughout 
the complete cycle Tmx. 

8 g is one complete external memory cycle comprising the total of T1+T2+T3+T4+Twait+T5+T6 where T1, T2, 
T3, T4, T5 can be from one to four periods Tm each in length, T6 can be from one to five periods Tm in length 
and Twait may be zero or any number of periods Tm in length. 
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Figure 8.5 IMS T805 external read cycle: dynamic memory 
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Tstate|T1 |T2|T3|T4|T5|T6|T1 | 
notMemSI | 1 


Tstate|Tl|T2|T3|T4|w|W|T5|T6|Tl| 


notMemSI 1 1 


1 


notMemS2 

No wait states 


not Mem S2 | | 


Wait states inserted 



Figure 8.6 IMS T805 effect of wait states on strobes 



Table 8.4 Strobe SO to ProcClockOut skew 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCHSOH 
TPCLSOH 
TPCHSOL 
TPCLSOL 


notMemSO rising from ProcClockOut rising 
notMemSO rising from ProcClockOut falling 
notMemSO falling from ProcClockOut rising 
notMemSO falling from ProcClockOut falling 


-6 
-5 
-8 
-5 




4 
10 
3 

7 


ns 
ns 
ns 
ns 





ProcClockOut 



NotMemSO 



J h^ TPCHSOH J |-^T 



TPCHSOL 



rTPCLSOH 



^TPCLSOL 



Figure 8.7 IMS T805 skew of notMemSO to ProcClockOut 
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4 IMS T805 engineering data 



8.3 



Write cycle 



For write cycles the relevant bytes in memory are addressed by the write strobes notMemWrBO-3. 
particular byte is not to be written, then the corresponding data outputs are tristated. 



If a 



For a write cycle pin MemnotWrDO will be low during T1 and T2. Write data is placed on the bus at the start 
of T3 and removed at the end of T6. If T6 is extended to force the next cycle Tmx (page 80) to start on a 
rising edge of ProcClockOut, data will be valid during this time also. 

The transputer has both early and late write cycle modes. For a late write cycle the relevant write strobes 
notMemWrBO-3 are low during T4 and T5; for an early write they are also low during T3. Data should be 
latched into memory on the rising edge of the strobes in both cases, although it is valid until the end of T6. 
If the strobe duration is insufficient, it may be extended at configuration time by adding extra periods Tm to 
either or both of Tstates T4 and T5 for both early and late modes. For an early cycle they may also be added 
to T3. Further extension may be obtained by inserting wait states at the end of T4. If the data hold time is 
insufficient, extra periods Tm may be added to T6 to extend it. 

In the write cycle timing diagram ProcClockOut is included as a guide only; it is shown with each Tstate 
configured to one period Tm. The strobe is inactive during internal memory cycles. 



Table 8.5 Write 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TdVWrH 


Data setup before write 


d-7 




d+10 


ns 


1.5 


TWrHdX 


Data hold after write 


a-10 




a+5 


ns 


1.2 


TSOLWrL 


notMemSO before start of early write 


b-5 




b+5 


ns 


1.3 




notMemSO before start of late write 


0-5 




0+5 


ns 


1.4 


TSOHWrH 


End of write from end of notMemSO 


-5 




4 


ns 


1 


TWrLWrH 


Early write pulse width 


d-4 




d+7 


ns 


1,5 




Late write pulse width 


e-4 




e+7 


ns 


1,6 



Notes 



1 Timing is for all write strobes notMemWrBO-3. 

2 a is T6 where T6 can be from one to five periods Tm in length. 

3 b is T2 where T2 can be from one to four periods Tm in length. 

4 c is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

5 d is total of T3+T4+Twait+T5 where T3, T4, T5 can be from one to four periods Tm each in length and Twait 
may be zero or any number of periods Tm in length. 

6 e is total of T4+Twait+T5 where T4, T5 can be from one to four periods Tm each in length and Twait may be 
zero or any number of periods Tm in length. 
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Figure 8.8 IMS T805 external write cycle 



8.4 



Wait 



Taking MemWait high with the timing shown (figure 8.9) will extend the duration of T4. IVlemWait is sampled 
relative to the falling edge of ProcClockOut during a T3 period, and should not change state in this region. 
By convention, notMemS4 is used to synchronize wait state insertion. If this or another strobe is used, its 
delay should be such as to take the strobe low an even number of periods Tm after the start of T1, to coincide 
with a rising edge of ProcClockOut. 

MemWait may be kept high indefinitely, although if dynamic memory refresh is used it should not be kept 
high long enough to interfere with refresh timing. MemWait operates normally during all cycles, including 
refresh and configuration cycles. It does not affect internal memory access in any way. 

If the start of T5 would coincide with a falling edge of ProcClockOut an extra wait period Tm (EW) is 
generated by the EMI to force coincidence with a rising edge. Rising edge coincidence is only forced if wait 
states are added, otherwise coincidence with a falling edge is permitted. 
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4 IMS T805 engineering data 



Table 8.6 Memory wait 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLWtH 
TPCLWtL 
TWtLWtH 


Wait setup 

Wait hold 

Delay before re-assertion of Wait 


-(0.5Tm+9) 

0.5Tm+10 

2Tm 






ns 
ns 


1.2 
1,2 



Notes 



1 ProcClockOut load should not exceed 50pf. 

2 If wait period exceeds refresh interval, refresh cycles will be lost. 



Tstate I T2 I T3 I T4 I^W I T5 I T6 I T1 | 
ProcClockOut / V / 
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Figure 8.9 IMS T805 memory wait timing 
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8.5 



Memory refresh 



The RefreshPending pin is asserted high when the external memory interface is about to perform a refresh 
cycle. It remains high until the refresh cycle is started by the transputer. The mimimum time for the Re- 
freshPending pin to be high is for one cycle of ProcClockOut (two periods Tm), when the EMI was not 
about to perform a memory read or write. If the EMI was held in the tristate condition with IVIemGranted 
asserted, then RefreshPending will be asserted when the refresh controller in the EMI is ready to perform 
a refresh. IVIemReq may be re-asserted any time after the commencement of the refresh cycle. Refresh- 
Pending changes state near the rising edge of ProcCloclcOut and can therefore be sampled by the falling 
edge of ProcClockOut. 

If no DMA is active then refresh will be performed following the end of the current internal or external memory 
cycle. If DMA is active the transputer will wait for DMA to terminate before commencing the refresh cycle. 
Unlike MemnotRfDI, RefreshPending is never tristated and can thus be interrogated by the DMA device; 
the DMA cycle can then be suspended, at the discretion of the DMA device, to allow refresh to take place. 

The simple circuit of Figure 8.10 will suspend DMA requests from the external logic when RefreshPending 
is asserted, so that a memory refresh cycle can be performed. DMA is restored on completion of the refresh 
cycle. The transputer will not perform an external memory cycle other than a refresh cycle, using this method, 
until the requesting device removes its DMA request. 
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Figure 8.10 IMS T805 refresh with DMA 



When refresh is disabled no refresh cycles occur. During the post-Reset period eight dummy refresh cycles 
will occur with the appropriate timing but with no bus or strobe activity. 

A refresh cycle uses the same basic external memory timing as a normal external memory cycle, except that 
it starts two periods Tm before the start of T1. If a refresh cycle is due during an external memory access, 
it will be delayed until the end of that external cycle. Two extra periods Tm (periods R in the diagram) will 
then be inserted between the end of T6 of the external memory cycle and the start of T1 of the refresh cycle 
itself. The refresh address and various external strobes become active approximately one period Tm before 
T1. Bus signals are active until the end of T2, whilst notMemRf remains active until the end of T6. 

For a refresh cycle, MemnotRfDI goes low before notMemRf goes low and MemnotWrDO goes high with 
the same timing as MemnotRfDI. All the address lines share the same timing, but only MemAD2-11 give 
the refresh address. Mem AD1 2-30 stay high during the address period, whilst MemAD31 remains low. 
Refresh cycles generate strobes notMemSO-4 with timing as for a normal external cycle, but notMemRd and 
notMemWrBO-3 remain high. MemWait operates normally during refresh cycles. 

Refresh cycles do not interrupt internal memory accesses, although the internal addresses cannot be reflected 
on the external bus during refresh. 
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Table 8.7 Memory refresh 



SYIVIBOL 


PARAIVIETER 


IVIIN 


NOIVI 


MAX 


UNITS 


NOTE 


TRfLRfH 

TRaVSOL 

TRfLSOL 


Refresh pulse width low 

Refresh address setup before notMemSO 

Refresh indicator setup before notMemSO 


a-2 
b-12 
b-4 


b 


a+9 
b+6 


ns 
ns 
ns 


1 
2 



Notes 



1 a is total Tmx+Tm. 

2 b is total T1+Tm where T1 can be from one to four periods Tm in length. 
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Figure 8.1 1 IMS T805 refresh cycle timing 
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Figure 8.12 IMS T805 RefreshPending timing 
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8.6 



Direct memory access 



Direct memory access (DMA) can be requested at any time by taking the asynchronous MemReq input high. 
The transputer samples MemReq during the final period Tm of T6 of both refresh and external memory 
cycles. To guarantee taking over the bus immediately following either, MemReq must be set up at least two 
periods Tm before the end of T6. In the absence of an external memory cycle, MemReq is sampled during 
every low period of ProcClockOut. The address bus is tristated two periods Tm after the ProcClocltOut 
rising edge which follows the sample. MemGranted is asserted one period Tm after that. 

Removal of MemReq is sampled during each low period of ProcClockOut and MemGranted is removed 
synchronously with the next falling edge of ProcClockOut. If accurate timing of DMA is required, MemReq 
should be set low coincident with a falling edge of ProcClockOut. Further external bus activity, either refresh, 
external cycles or reflection of internal cycles, will commence at the next rising edge of ProcClockOut. 

Strobes are left in their inactive states during DMA. DMA cannot interrupt a refresh or external memory cycle, 
and outstanding refresh cycles will occur before the bus is released to DMA. DMA does not interfere with 
internal memory cycles in any way, although a program running in internal memory would have to wait for 
the end of DMA before accessing external memory. DMA cannot access internal memory. If DMA extends 
longer than one refresh interval (Memory Refresh Configuration Coding, table 8.11), the DMA user becomes 
responsible for refresh. DMA may also inhibit an internally running program from accessing external memory. 

DMA allows a bootstrap program to be loaded into external RAM ready for execution after reset. If MemReq is 
held high throughout reset, MemGranted will be asserted before the bootstrap sequence begins. MemReq 
must be high at least one period TDCLDCL of Clockln before Reset. The circuit should be designed to 
ensure correct operation if Reset could interrupt a normal DMA cycle. 

Table 8.8 Memory request 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMRHMGH 
TMRLMGL 
TADZMGH 
TMGLADV 


Memory request response time 
Memory request end response time 
Bus tristate before memory granted 
Bus active after end of memory granted 


4Tm-2ns 
2Tm-2ns 
Tm-2ns 

-10ns 




7Tm+7ns 

5Tm+22ns 

Tm+22ns 

Tm+2ns 




1 



Notes 



1 These values assume no external memory cycle is in progress. If an external cycle is active, maximum time 
could be (1 EMI cycle Tmx)+(1 refresh cycle TRfLRfH)+(6 periods Tm). 
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Figure 8.13 IMS T805 memory request timing 
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Figure 8.14 IMS T805 DMA sequence at reset 
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Figure 8.15 IMS T805 operation of MemReq, MemGranted with external, refresh memory cycles 
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Figure 8.16 IMS T805 operation of MemReq, MemGranted with external, internal memory cycles 
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8.7 



Memory configuration 



MemConfig is an input pin used to read configuration data when setting external memory interface (EiVII) 
cliaracteristics. It is read by the processor on two occasions after Reset goes low; first to check if one of the 
preset internal configurations is required, then to determine a possible external configuration. 

8.7.1 Internal configuration 

The internal configuration scan comprises 64 periods TDCLDCL of Clockln during the internal scan period 
of 144 Clockln periods. MemnotWrDO, MemnotRfDI and MemAD2-32 are all high at the beginning of the 
scan. Starting with MemnotWrDO, each of these lines goes low successively at intervals of two Clockln 
periods and stays low until the end of the scan. If one of these lines is connected to MemConfig the preset 
internal configuration mode associated with that line will be used as the EMI configuration. The default 
configuration is that defined in the table for MemAD31; connecting MemConfig to VCC will also produce 
this default configuration. Note that only 17 of the possible configurations are valid, all others remain at the 
default configuration. 

Table 8.9 IMS T805 internal configuration coding 





Duration of each Tstate 




Strobe 




Write 


Refresh 


Cycle 








periods Tm 








coefficient 




cycle 


interval 


time 




















Clockln 


Proc 


Pin 


T1 


12 


13 14 


15 


16 


S1 


s2 S3 


S4 


type 


cycles 


cycles 


MemnotWrDO 


1 


1 


1 1 


1 


1 


30 


1 3 


5 


late 


72 


3 


MemnotRfDI 


1 


2 


1 1 


1 


2 


30 


1 2 


7 


late 


72 


4 


MemAD2 


1 


2 


1 1 


2 


3 


30 


1 2 


7 


late 


72 


5 


MemADS 


2 


3 


1 1 


2 


3 


30 


1 3 


8 


late 


72 


6 


MemAD4 


1 


1 


1 1 


1 


1 


3 


1 2 


3 


early 


72 


3 


MemADS 


1 


1 


2 1 


2 


1 


5 


1 2 


3 


early 


72 


4 


MemADS 


2 


1 


2 1 


3 


1 


6 


1 2 


3 


early 


72 


5 


MemAD7 


2 


2 


2 1 


3 


2 


7 


1 3 


4 


early 


72 


6 


MemAD8 


? 


1 


1 1 


1 


1 


30 


1 2 


3 


early 


t 


3 


MemADS 


1 


1 


2 1 


2 


1 


30 


2 5 


9 


early 


t 


4 


MemADIO 


2 


2 


2 2 


4 


2 


30 


2 3 


8 


late 


72 


7 


MemADII 


3 


3 


3 3 


3 


3 


30 


2 4 


13 


late 


72 


9 


MemADl2 


1 


1 


2 1 


2 


1 


4 


1 2 


3 


early 


72 


4 


MemADia 


2 


1 


2 1 


2 


2 


5 


1 2 


3 


early 


72 


5 


MemAD14 


2 


2 


2 1 


3 


2 


6 


1 3 


4 


early 


72 


6 


MemADIS 


2 


1 


2 3 


3 


3 


8 


1 2 


3 


early 


72 


7 


MemADSI 


4 


4 


4 4 


4 


4 


31 


30 30 


18 


late 


72 


12 



t Provided for static RAM only. 
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Figure 8.17 IMS T805 internal configuration 
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4 IMS T805 engineering data 



Internal configuration 



External configuration 




IVIemnotWrDO 
IVIemnotRfDI 
l\/lemAD2 
IVIemADS 

i 

IVIemADSI 
IVIemConfig © 
IVIemConfig @ 



© Internal configuration: IVIemConfig connected to MemAD2 

@ External configuration: MemConfig connected to inverse of MemAD3 



Figure 8.18 IMS T805 internal configuration scan 

8.7.2 External configuration 

If MemConfig is held low until MemnotWrDO goes low the internal configuration is ignored and an external 
configuration will be loaded instead. An external configuration scan always follows an internal one, but if an 
internal configuration occurs any external configuration is ignored. 

The external configuration scan comprises 36 successive external read cycles, using the default EMI con- 
figuration preset by MemAD31. However, instead of data being read on the data bus as for a normal read 
cycle, only a single bit of data is read on MemConfig at each cycle. Addresses put out on the bus for each 
read cycle are shown In table 8.10, and are designed to address ROM at the top of the memory map. The 
table shows the data to be held in ROM; data required at the MemConfig pin is the inverse of this. 

MemConfig is typically connected via an inverter to MemnotWrDO. Data bit zero of the least significant byte 
of each ROM word then provides the configuration data stream. By switching MemConfig between various 
data bus lines up to 32 configurations can be stored in ROM, one per bit of the data bus. MemConfig can be 
permanently connected to a data line or to GND. Connecting MemConfig to GND gives all Tstates configured 
to four periods; notMemSl pulse of maximum duration; notMemS2-4 delayed by maximum; refresh interval 
72 periods of Clockln; refresh enabled; late write. 

The external memory configuration table 8.10 shows the contribution of each memory address to the 13 con- 
figuration fields. The lowest 12 words (#7FFFFF6C to #7FFFFF98, fields 1 to 6) define the number of extra 
periods Tm to be added to each Tstate. If field 2 is 3 then three extra periods will be added to T2 to extend 
it to the maximum of four periods. 

The next five addresses (field 7) define the duration of notMemSl and the following fifteen (fields 8 to 1 0) 
define the delays before strobes notMemS2-4 become active. The five bits allocated to each strobe allow 
durations of from to 31 periods Tm, as described in strobes page 80. 

Addresses #7FFFFFEC to #7FFFFFF4 (fields 1 1 and 12) define the refresh interval and whether refresh is to 
be used, whilst the final address (field 13) supplies a high bit to MemConfig if a late write cycle is required. 



The columns to the right of the coding table show the values of each configuration bit for the four sample 



8 External memory interface 



99 



external configuration diagrams. Note tiie inclusion of period E at the end of T6 in some diagrams. This is 
inserted to bring the start of the next Tstate T1 to coincide with a rising edge of ProcClockOut (page 82). 

Wait states W have been added to show the effect of them on strobe timing; they are not part of a configuration. 
In each case which includes wait states, two wait periods are defined. This shows that if a wait state would 
cause the start of T5 to coincide with a falling edge of ProcClockOut, another period Tm is generated by 
the EMI to force it to coincide with a rising edge of ProcClockOut. This coincidence Is only necessary if wait 
states are added, otherwise coincidence with a falling edge is permitted. Any configuration memory access 
is only permitted to be extended using wait, up to a total of 14 Clockln periods. 
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Figure 8.19 IMS T805 external configuration 
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4 IMS T805 engineering data 



Internal configuration 



Address 



External configuration 
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' IVIemConfig connected to inverse of MemnotWrDO 

I Configuration field 1 ; T1 configured for 2 periods Tm 

' Configuration field 2; T2 configured for 3 periods Tm 

I Configuration field 10; most significant bit of notMemS4 configured high 

I Configuration field 11 ; refresh interval configured for 36 periods Clockln 

' Configuration field 12; refresh enabled 

' Configuration field 13; early write cycle 



Figure 8.20 IMS T805 external configuration scan 
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Table 8.10 IMS T805 external configuration coding 



Scan 
cycle 


MemAD 
address 


Field 


Function 


Example diagram 


1 


2 


3 


4 


1 


7FFFFF6C 


1 


T1 least significant bit 














2 


7FFFFF70 


1 


T1 most significant bit 














3 


7FFFFF74 


2 


T2 least significant bit 


1 








1 


4 


7FFFFF78 


2 


T2 most significant bit 














5 


7FFFFF7C 


3 


T3 least significant bit 


1 


1 


1 


1 


6 


7FFFFF80 


3 


T3 most significant bit 














7 


7FFFFF84 


4 


T4 least significant bit 














8 


7FFFFF88 


4 


T4 most significant bit 














9 


7FFFFF8C 


5 


T5 least significant bit 














10 


7FFFFF90 


5 


T5 most significant bit 














11 


7FFFFF94 


6 


T6 least significant bit 


1 





1 


1 


12 


/l-hFFF98 


6 


T6 most significant bit 














13 


7FFFFF9C 


7 


notMemSI least significant bit 








1 


1 


14 


7FFFFFA0 


7 
















15 


7FFFFFA4 


7 


^ ^ 














16 


7FFFFFA8 


7 




1 











17 


7FFFFFAC 


7 


notMemSI most significant bit 














18 


7FFFFFB0 


8 


notMemS2 least significant bit 


1 








1 


19 


7FFFFFB4 


8 




1 


1 





1 


20 


7FFFFFB8 


8 


^ ^ 











1 


21 


7FFFFFBC 


8 
















22 


7FFFFFC0 


8 


notMemS2 most significant bit 














23 


7FFFFFC4 


9 


notMemS3 least significant bit 


1 


1 


1 


1 


24 


7FFFFFC8 


9 







1 








25 


7FFFFFCC 


9 


^ ^ 





1 





1 


26 


7FFFFFD0 


9 










1 





27 


7FFFFFD4 


9 


notMemS3 most significant bit 














28 


7FFFFFD8 


10 


notMemS4 least significant bit 











1 


29 


7FFFFFDC 


10 







1 


1 


1 


30 


7FFFFFE0 


10 


^ ^ 


1 


1 








31 


7FFFFFE4 


10 
















32 


7FFFFFE8 


10 


notMemS4 most significant bit 














33 


7FFFFFEC 


11 


Refresh Interval least significant bit 


- 


- 


- 


- 


34 


7FFFFFF0 


11 


Refresh Interval most significant bit 


- 


- 


- 


- 


35 


7FFFFFF4 


12 


Refresh Enable 


- 


- 


- 


- 


36 


7FFFFFF8 


13 


Late Write 





1 


1 
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4 IMS T805 engineering data 



Table 8.1 1 IMS T805 memory refresh configuration coding 



Refresli 


Interval 


Field 11 


Complete 


Interval 


in fis 


encoding 


cycle (ms) 


18 


3.6 


00 


0.922 


36 


7.2 


01 


1.843 


54 


10.8 


10 


2.765 


72 


14.4 


11 


3.686 



Refresh intervals are In periods of Clockln and Clockln frequency is 5 MHz: 

Interval = 18 * 200 = 3600 ns 

Refresh interval Is between successive Incremental refresh addresses. 
Complete cycles are shown for 256 row DRAMS. 

Table 8.12 Memory configuration 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMCVRdH 
TRdHMCX 
TSOLRdH 


Memory configuration data setup 
Memory configuration data hold 
notMemSO to configuration data read 


25 



a-12 




a+12 


ns 
ns 
ns 


1 



Notes 



1 a is 1 6 periods Tm. 
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T3 T4 T5 1 T6 i T1 
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III III 1 1 1 1 1 1 1 1 1 1 1 1 1 
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^«««««« Data >» 
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— ««««««k Data »> 



Figure 8.21 IMS T805 external configuration read cycle timing 
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9 Events 

EventReq and EventAck provide an asynchronous handshake interface between an external event and an 
internal process. When an external event takes EventReq high the external event channel (additional to the 
external link channels) is made ready to communicate with a process. When both the event channel and the 
process are ready the processor takes EventAck high and the process, if waiting, is scheduled. EventAck 
is removed after EventReq goes low. 

EventWaiting is asserted high by the transputer when a process executes an input on the event channel; 
typically with the occam event ? ANY instruction. It remains high whilst the transputer is waiting for or 
servicing EventReq and is returned low when EventAck goes high. The EventWaiting pin changes near the 
falling edge of ProcClockOut and can therefore be sampled by the rising edge of ProcClockOut. 

The EventWaiting pin can only be asserted by executing an in instruction on the event channel. The 
EventWaiting pin is not asserted high when an enable channel (enbc) instruction is executed on the Event 
channel (during an ALT construct in occam, for example). The EventWaiting pin can be asserted by executing 
the occam input on the event channel (such as Event ? ANY), provided that this does not occur as a 
guard in an alternative process. The EventWaiting pin can not be used to signify that an alternative process 
(ALT) is waiting on an input from the event channel. 

EventWaiting allows a process to control external logic; for example, to clock a number of inputs into a 
memory mapped data latch so that the event request type can be determined. 

Only one process may use the event channel at any given time. If no process requires an event to occur 
EventAck will never be taken high. Although EventReq triggers the channel on a transition from low to high, 
it must not be removed before EventAck is high. EventReq should be low during Reset; if not it will be 
ignored until it has gone low and returned high. EventAck is taken low when Reset occurs. 

If the process is a high priority one and no other high priority process is running, the latency is as described 
on page 56. Setting a high priority task to wait for an event input allows the user to interrupt a transputer 
program running at low priority. The time taken from asserting EventReq to the execution of the microcode 
interrupt handler in the CPU is four cycles. The following functions take place during the four cycles: 

Cycle 1 Sample EventReq at pad on the rising edge of ProcClockOut and synchronise. 

Cycle 2 Edge detect the synchronised EventReq and form the interrupt request. 

Cycle 3 Sample interrupt vector for microcode ROM in the CPU. 

Cycle 4 Execute the interrupt routine for Event rather than the next instruction. 
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4 IMS T805 engineering data 



Table 9.1 Event 



SYIVIBOL 


PARAIVIETER 


MIN 


NOIVI 


MAX 


UNITS 


NOTE 


TVHKH 

TKHVL 

TVLKL 

TKLVH 

TKHEWL 

TKLEWH 


Event request response 

Event request hold 

Delay before removal of event acknowledge 

Delay before re-assertion of event request 

Event acknowledge to end of event waiting 

End of event acknowledge to event waiting 












6Tm+7ns 


ns 
ns 

ns 
ns 
ns 













EventReq / 
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A 
-^ TVLKL 

^ TKLVH ^ 


/ 


TVHKH -*J 
EventAck / 


^ TKHVL ^ 
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/^ 


^ TKLEWH ^ 




EventWaitIng j \ 

Process waiting for Event 




>T\- 

Event waiting for Process 



Figure 9.1 IMS T805 event timing 
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10 



Links 



Four identical INMOS bi-directional serial links provide synchronized communication between processors 
and with the outside world. Each link comprises an input channel and output channel. A link between two 
transputers is implemented by connecting a link interface on one transputer to a link interface on the other 
transputer. Every byte of data sent on a link is acknowledged on the input of the same link, thus each signal 
line carries both data and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged data byte 
and that the receiving link is able to receive another byte. The sending link reschedules the sending process 
only after the acknowledge for the final byte of the message has been received. 

The IMS T805 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they can 
be used at 5 or 20 Mbits/sec for 17 MHz, 20 MHz, and 25 MHz devices, and 20 Mbits/sec for faster devices. 
Links are not synchronised with Clockln or ProcClockOut and are insensitive to their phases. Thus links 
from independently clocked systems may communicate, providing only that the clocks are nominally identical 
and within specification. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 mlHimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

Link speeds can be set by LinkSpecial, LinkOSpecial and Link123Special. The link speed can be set 
independently. Table 10.1 shows uni-directional and bi-directional data rates in Kbytes/sec for each link 
speed; LinknSpecial is to be read as LinkOSpecial when selecting link speed and as Link123Special for 
the others. Data rates are quoted for a transputer using internal memory, and will be affected by a factor 
depending on the number of external memory accesses and the length of the external memory cycle. 

Table 10.1 Speed Settings for Transputer Links 



Link 
Special 


Linkn 
Special 


Mbits/sec 


Kbytes/sec 


Notes 


Uni 


Bi 





1 
1 




1 


1 


10 
5 
10 
20 


910 

450 

910 

1740 


1250 

670 

1250 

2350 


1 



Notes 



1 This setting is reserved for IMS T805-30 and faster devices. 





H L, 
1 Ack 1 


H'H 01234567 L, 


1 Data 1 



Figure 10.1 IMS T805 link data and acknowledge packets 
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4 IMS T805 engineering data 



Table 10.2 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




10 Mbits/s 






10 


ns 


2 




5 Mbits/s 






30 


ns 


2 


CLIZ 


Linkin capacitance @ f=1 MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for lOOn transmission line 




56 




ohms 





Notes 



1 These parameters are sampled, but not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 




Figure 10.2 IMS T805 link timing 
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Figure 10.3 IMS T805 buffered link timing 
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Figure 10.4 IMS T805 Links directly connected 
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Figure 10.5 IMS T805 Links connected by transmission line 
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Figure 10.6 IMS T805 Links connected by buffers 
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11 Electrical specifications 

11.1 DC electrical characteristics 



Table 11.1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


vcc 


DC supply voltage 





7.0 


V 


1,2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1,2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



1 All voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 1 1 .2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 


4.75 


5.25 


V 


1 


VI, VO 


Input or output voltage 





VCC 


V 


1,2 


CL 


Load capacitance on any pin 




60 


PF 




TA 


Operating temperature range IMS T805-S 





70 


°C 


3 


TA 


Operating temperature range IMS T805-M 


-55 


125 


°C 


3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 



11 Electrical specifications 



109 



Table 11 .3 DC characteristics 



SYIVIBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VCC+0.5 


V 


1,2 


VIL 


Low level input voltage 


-0.5 


0.8 


V 


1.2 


II 


Input current @ GND<VI<VCC 




±10 


mA 


1,2 


VOH 


Output high voltage @ IOH=2mA 


VCC-1 




V 


1,2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1,2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1,2,3,6 






65 


100 


mA 


1,2,4,6 


lOZ 


Tristate output current @ GND<VO<VCC 




±10 


/iA 


1,2 


PD 


Power dissipation 




1.2 


W 


2,5 


CIN 


Input capacitance @ f=:l MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=l MHz 




10 


PF 


6 



Notes 



1 All voltages are with respect to GND. 

2 Parameters for IMS T805-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading and program execution. 
Power dissipation for processor operating at 20 MHz. 

6 This parameter is sampled and not 100% tested. 



11.2 Equivalent circuits 



VCC —r— 

RIM 

niitniit r - 1-^ 1 










Load for: 


R1 


R2 


Equivalent load: 




Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky TTL inputs 


50pF = 


l'^ T 


Diodes are 1N916 



















Figure 11.1 Load circuit for AG measurements 
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Inputs VIH 
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Figure 11.2 AC measurements timing waveforms 
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Figure 11.3 Tristate load circuit for AC measurements 
11.3 AC timing characteristics 

Table 1 1 .4 Input, output edges 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


TDr 


Input rising edges 


2 


20 


ns 


1,2.4 


TDf 


Input falling edges 


2 


20 


ns 


1,2,4 


TQr 


Output rising edges 




25 


ns 


1,5 


TQf 


Output falling edges 




15 


ns 


1.5 


TSOLaHZ 


Address high to tristate 


a 


a+6 


ns 


3 


TSOLaLZ 


Address low to tristate 


a 


a+6 


ns 


3 



Notes 



1 Non-link pins; see section on links. 

2 All inputs except Clockln; see section on Clockln. 

3 a is T2 where T2 can be from one to four periods Tm in length. 
Address lines include MemnotWrDO, MemnotRfDI, MemAD2-31. 

4 These parameters are not tested. 

5 These parameters are sampled, but not 100% tested. 
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Figure 11.4 IMS T805 input and output edge timing 
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Figure 11.5 IMS T805 tristate timing relative to notMemSO 
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Figure 1 1 .6 Typical rise/fall times 



Notes 



1 Skew is measured between notMemSO with a standard load (2 Schottky TTL inputs and 30 pF) and notMemSO 
with a load of 2 Schottky TTL inputs and varying capacitance. 
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11.4 Power rating 

Internal power dissipation Pint of transputer and peripheral chips depends on VCC, as shown in figure 1 1 .7. 
Pint is substantially independent of temperature. 

Total power dissipation Pd of the chip is 

Pd = Pint + Pio 

where Pio is the power dissipation in the input and output pins; this is application dependent. 

Internal working temperature Tj of the chip is 

Tj^Ta-^OJa^Pd 

where Ta is the external ambient temperature in °C and BJa is the junction-to-ambient thermal resistance in 
°C/W. 6Ja for each package is given in the Packaging Specifications section. 
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700- 

Power 

PINT 600 - 

mW 

500- 




T805-30 (projected) 

T805-25 
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T805-1 7 



~] — I — I — \ — I — I — \ — I — I — I — \ — \ — I — r 
4.4 4.6 4.8 5.0 5.2 5.4 5.6 
VCC Volts 



Figure 1 1 .7 IMS T805 internal power dissipation vs VCC 
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Figure 11.8 IMS T805 typical power dissipation with processor speed 
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12 



Performance 



The performance of the transputer is measured in terms of the number of bytes required for the program, and 
the number of (internal) processor cycles required to execute the program. The figures here relate to Occam 
programs. For the same function, other languages should achieve approximately the same performance as 
Occam. 

With transputers incorporating an FPU, this type of performance calculation is straight forward when consider- 
ing only integer data types. However, when floating point calculations using the REAL32 and REAL64 data 
types are present in the program, complications arise due to the concurrency inherent in the transputer's de- 
sign whereby integer calculations can be overlapped with floating point calculations. A more comprehensive 
guide to the impact of this concurrency on transputer performance can be found in the Transputer Instruction 
Set - A Compiler Writers' Guide. 



12.1 



Performance overview 



These figures are averages obtained from detailed simulation, and should be used only as an initial guide; 
they assume operands are of type INT. The abbreviations in table 12.1 are used to represent the quantities 
indicated. In the replicator section of the table, figures in braces {} are not necessary if the number of 
replications is a compile time constant. To estimate performance, add together the time for the variable 
references and the time for the operation. 



Table 12.1 Key to performance table 



np 

ne 

r 

ts 

w 

P 

Eg 

Et 

Tb 

Tbp 

Tbc 

nsp 

nap 



number of component processes 

number of processes earlier in queue 

1 if INT parameter or array parameter, if not 

number of table entries (table size) 

width of constant in nibbles 

number of places to shift 

expression used in a guard 

timer expression used in a guard 

most significant bit set of multiplier ((-1) if the multiplier is 0) 

most significant bit set in a positive multiplier when counting from zero ((-1) if the multiplier is 0) 

most significant bit set in the two's complement of a negative multiplier 

Number of scalar parameters in a procedure 

Number of array parameters in a procedure 
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Table 12.2 Performance 





Size (bytes) 


Time (cycles) 


Names 

variables 

in expression 
assigned to or input to 
in PROC or FUNCTION call, 
corresponding to an INT parameter 

channels 


1.1+r 
1.1 +r 

1.1+r 
1.1 


2.1+2(r) 
1.1 +(r) 

1.1 +(r) 
2.1 


Array Variables (for single dimension arrays) 
constant subscript 
variable subscript 
expression subscript 




5.3 
5.3 




7.3 
7.3 


Declarations 

CHAN OF protocol 
[size] CHAN OF protocol 
PROC 


3.1 

9.4 

body+2 


3.1 

2.2 + 20.2*size 




Primitives 

assignment 

input 

output 

STOP 

SKIP 



4 
1 
2 





26.5 

26 

25 




Arithmetic operators 
+ 
* 

/ 

REM 

» « 


1 
2 
2 
2 
2 


1 

39 
40 
38 
3+p 


Modulo Arithmetic operators 
PLUS 
MINUS 

TIMES (fast multiply, positive operand) 
TIMES (fast multiply, negative operand) 


2 

1 
1 
1 


2 

1 
4+Tbp 
5+Tbc 


Boolean operators 

OR 

AND NOT 


4 

1 


8 
2 


Comparison operators 
= constant 
= variable 
<> constant 
<> variable 
> < 
>= <= 



2 
1 
3 
1 
2 


1 
3 
3 
5 
2 
4 


Bit operators 

A \/ X - 


2 


2 


Expressions 

constant in expression 
check if error 


w 

4 


w 

6 
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Table 12.3 Performance 





Size (bytes) 


Time (cycles) 


Timers 






timer input 


2 


3 


timer AFTER 






If past time 


2 


4 


with empty timer queue 


2 


31 


non-empty timer queue 


2 


38+ne*9 


ALT (timer) 






with empty timer queue 


6 


52 


non-empty timer queue 


6 


59+ne*9 


timer alt guard 


8+2Eg4-2Et 


34+2Eg+2Et 


Constructs 






SEQ 








IF 


1.3 


1.4 


if guard 


3 


4.3 


ALT (non timer) 


6 


26 


alt channel guard 


10.2+2Eg 


20+2Eg 


skip alt guard 


8+2Eg 


10+2Eg 


PAR 


11.5+(np-1)*7.5 


19.5+(np-1)*30.5 


WHILE 


4 


12 


Procedure or function call 








3.5+(nsp-2)*1.1 


16.5+(nsp-2)*1.1 




+nap*2.3 


+nap*2.3 


Replicators 






replicated SEQ 


7.3{+5.1} 


(-3.8)+15.1*count{+7.1} 


replicated IF 


12.3{+5.1} 


(-2.6)+19.4*count{+7.1} 


replicated ALT 


24.8{+10.2} 


25.4+33.4*count{+14.2} 


replicated timer ALT 


24.8{+10.2} 


62.4+33.4*count{+14.2} 


replicated PAR 


39.1 {+5.1} 


(-6.4)+70.9*count{+7.1} 



12.2 Fast multiply, TIMES 

The IMS T805 has a fast Integer multiplication Instruction product. For a positive multiplier its execution time 
Is 4+Tbp cycles, and for a negative multiplier 5+Tbc cycles (table 12.1). The time taken for a multiplication 
by zero Is 3 cycles. 

Implementations of high level languages on the transputer may take advantage of this instruction. For example, 
the Occam modulo arithmetic operator TIMES is implemented by the instruction and the right-hand operand is 
treated as the multiplier. The fast multiplication instruction is also used in high level language implementations 
for the multiplication implicit in multi-dimensional array access. 
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12.3 



Arithmetic 



A set of functions are provided witiiin the development system to support the efficient implementation of 
multiple length integer arithmetic. In the IMS T805, floating point arithmetic is taken care of by the FPU. In 
table 12.4 n gives the number of places shifted and all arguments and results are assumed to be local. Full 
details of these functions are provided in the Occam reference manual, supplied as part of the development 
system and available as a separate publication. 

When calculating the execution time of the predefined maths functions, no time needs to be added for calling 
overhead. These functions are compiled directly into special purpose instructions which are designed to 
support the efficient implementation of multiple length integer arithmetic and floating point arithmetic. 

Table 12.4 Arithmetic performance 



Function 


Cycles 


+ cycles for 
parameter access f 


LONGADD 




2 


7 


LONGSUM 




3 


8 


LONGSUB 




2 


7 


LONGDIFF 




3 


8 


LONGPROD 




34 


8 


LONGDIV 




36 


8 


SHIFTRIGHT 


(n<32) 


4+n 


8 


SHIFTLEFT 


(n>=32) 
(n<32) 


n-27 
4+n 


8 


NORMALISE 
ASHIFTRIGHT 


(n>=32) 
(n<32) 
(n>=32) 
(n=64) 


n-27 
n+6 
n-25 

4 
SHIFTRIGHT+2 


7 
5 


ASHIFTLEFT 




SHIFTLEFT+4 


5 


ROTATERIGHT 




SHIFTRIGHT 


7 


ROTATELEFT 




SHIFTLEFT 


7 


FRACMUL 




LONGPROD+4 


5 



t Assuming local variables. 
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12.4 Floating point operations 

All references to REAL32 or REAL64 operands within programs compiled for the IMS T805 normally produce 
the following performance figures. 

Table 12.5 Floating point performance 





Size (bytes) 


REAL32 Time (cycles) 


REAL64 Time (cycles) 


Names 








variables 








in expression 


3.1 


3 


5 


assigned to or input to 


3.1 


3 


5 


in PROC or FUNCTION call, 








corresponding to a REAL 








parameter 


1.1+r 


1.1+r 


1.1+r 


Aritlimetic operators 








+ 


2 


7 


7 


• 


2 


11 


20 


/ 


2 


17 


32 


REM 


11 


19 


34 


Comparison operators 








= 


2 


4 


4 


<> 


3 


6 


6 


> < 


2 


5 


5 


>= <= 


3 


7 


7 


Conversions 








REALS 2 to - 


2 




3 


REAL64 to - 


2 


6 




To INT32 from - 


5 


9 


9 


To INT64 from - 


18 


32 


32 


INT32 to - 


3 


7 


7 


INT64 to - 


14 


24 


22 



12.4.1 Floating point functions 

These functions are provided by the development system. They are compiled directly into special purpose 
instructions designed to support the efficient implementation of some of the common mathematical functions 
of other languages. The functions provide ABS and SQRT for both REAL32 and REAL64 operand types. 

Table 12.6 IMS T805 floating point arithmetic performance 



Function 


Cycles 


+ cycles for parameter access t 


REAL32 


REAL64 


ABS 


2 


8 




SQRT 


118 


8 




DABS 


2 




12 


DSQRT 


244 




12 



t Assuming local variables. 
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12.4.2 Special purpose functions and procedures 

The functions and procedures given in tables 12.8 and 12.9 are provided by the development system to give 
access to the special instructions available on the IMS T805. Table 12.7 shows the key to the table. 



Table 12.7 Key to special performance table 



Tb 
n 
r 
nr 



most significant bit set in the word counting from zero 
number of words per row (consecutive memory locations) 
number of rows in the two dimensional move 
number of bits to reverse 



Table 12.8 Special purpose functions performance 



Function 


Cycles 


+ cycles for 
parameter access f 


BITCOUNT 


2+Tb 


2 


CRCBYTE 


11 


8 


CRCWORD 


35 


8 


BITREVNBIT 


5+nr 


4 


BITREVWORD 


36 


2 



t Assuming local variables. 



Table 12.9 Special purpose procedures performance 



Procedure 


Cycles 


+ cycles for 
parameter access f 


M0VE2D 
DRAW2D 
CLIP 2D 


8+(2n+23)*r 
8+(2n+23)*r 
8+(2n+23)*r 


8 
8 
8 



t Assuming local variables. 



12.5 Effect of external memory 

Extra processor cycles may be needed when program and/or data are held in external memory, depending 
both on the operation being performed, and on the speed of the external memory. After a processor cycle 
which initiates a write to memory, the processor continues execution at full speed until at least the next 
memory access. 

Whilst a reasonable estimate may be made of the effect of external memory, the actual performance will 
depend upon the exact nature of the given sequence of operations. 

External memory is characterized by the number of extra processor cycles per external memory cycle, denoted 
as e. For the IMS T805, with the fastest external memory the value of e is 2; a typical value for a large external 
memory is 5. 

if a program is stored in external memory, and e has the value 2 or 3, then no extra cycles need be estimated 
for linear code sequences. For larger values of e, the number of extra cycles required for linear code 
sequences may be estimated at (e-3)/4. A transfer of control may be estimated as requiring e+3 cycles. 

These estimates may be refined for various constructs. In table 12.10 n denotes the number of components 
in a construct. In the case of IF, the n'th conditional is the first to evaluate to TRUE, and the costs include the 
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costs of the conditionals tested. The number of bytes in an array assignment or communication is denoted 
by b. 

Table 12.10 External memory performance 





IMS T805 1 


Program off chip 


Data off chip 


Boolean expressions 


e-2 





IF 


3en-8 


en 


Replicated IF 


(6e-4)n+7 


(5e-2)n+8 


Replicated SEQ 


(3e-3)n+2 


(4e-2)n 


PAR 


(3e-1)n+8 


3en+4 


Replicated PAR 


(10e-8)n+8 


16en-12 


ALT 


(2e-4)n+6e 


(2e-2)n+10e-8 


Array assignment and 





max (2e, e(b/2)) 


communication in 






one transputer 







The following simulation results illustrate the effect of storing program and/or data in external memory. The 
results are normalized to 1 for both program and data on chip. The first program (Sieve of Erastosthenes) 
is an extreme case as it is dominated by small, data access intensive loops; it contains no concurrency, 
communication, or even multiplication or division. The second program is the pipeline algorithm for Newton 
Raphson square root computation. 



Table 12.11 IMS T805 external memory performance 





Program 


e=2 


e=3 


e=4 


e=5 


On chip 


Program off chip 


1 
2 


1.3 
1.1 


1.5 
1.2 


1.7 
1.2 


1.9 
1.3 




Data off chip 


1 
2 


1.5 
1.2 


1.8 
1.4 


2.1 
1.6 


2.3 

1.7 




Program and data off chip 


1 
2 


1.8 
1.3 


2.2 
1.6 


2.7 
1.8 


3.2 
2.0 





12.6 Interrupt latency 

If the process is a high priority one and no other high priority process is running, the latency is as described 
in table 12.12. The timings given are in full processor cycles TPCLPCL; the number of Tm states is also 
given where relevant. Maximum latency assumes all memory accesses are internal ones. 



Table 12.12 Interrupt latency 





Typical 


Maximum 


TPCLPCL 


Tm 


TPCLPCL 


Tm 


IMS T805 with FPU in use 
IMS T805 with FPU not in use 


19 
19 


38 
38 


78 
58 


156 
116 
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13 Package specifications 
13.1 84 pin grid array paclcage 
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Figure 13.1 IMS T805 84 pin grid array package pinout 
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Figure 13.2 84 pin grid array package dimensions 



Table 13.1 84 pin grid array package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B 


17.019 


±0.127 


0.670 


±0.005 




C 


2.456 


±0.278 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.025 


0.018 


±0.002 


Pin diameter 


G 


1.143 


±0.127 


0.045 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Package weight is approximately 7.2 grams 

Table 13.2 84 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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13.2 84 pin PLCC J-bend package 
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3 55 MemADS 
3 54 MemAD4 
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Figure 13.3 IMS T805 84 pin PLCC J-bend package pinout 
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Figure 13.4 84 pin PLCC J-bend package dimensions 



Table 13.3 84 pin PLCC J-bend package dimensions 



DllVI 


l\/lillimetres 


Inches 


Notes 


NOIVI 


TOL 


NOIVI 


TOL 


A 


30.226 


±0.127 


1.190 


±0.005 




B 


29.312 


±0.127 


1.154 


±0.005 




C 


3.810 


±0.127 


0.150 


±0.005 




D 


0.508 


±0.127 


0.020 


±0.005 




F 


1.270 


±0.127 


0.050 


±0.005 




G 


0.457 


±0.127 


0.018 


±0.005 




J 


0.000 


±0.051 


0.000 


±0.002 




K 


0.457 


±0.127 


0.018 


±0.005 




L 


0.762 


±0.127 


0.030 


±0.005 





Package weight is approximately 7.0 grams 

Table 13.4 84 pin PLCC J-bend package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 




35 




°c/w 
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4 IMS T805 engineering data 



13.3 84 lead quad cerpack package 

The leads are unformed to allow the user to form them to specific requirements. 



CapMlnus 75 c 

VCC76C 

ProcSpeedSelectO 77 c 

GND78C: 

Errorin 79 c 

ProcSpeedSelect2 80 c 

Error 81 c 

BootFromROM 82 c 

Reset 83 c 

DisablelntRAM84c: 



ProcSpeedSelectI 
Analyse 
MemAD31 
MemAD30 
MemAD29 
GND 
MemAD28 
MemAD27 
MemAD26 



ic 
2n 
3c 
4c: 
5c 
6c 
7n 
8c 
9c 



MemAD25 10 c 
MemAD24 1 1 c 



c 
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- *" 
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IMS T805 
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D53 MemReq 
d52 Mem Granted 
d51 Mem Wait 
=150 notMemRf 
D 49 notMemWrB3 
D 48 notMemWrB2 
D 47 notMemWrBI 
3 46 notMemWrBO 
D 45 notMemRd 
n 44 notMemSO 



3 43 VCC 

D 42 notMemS4 

d41 notMemS3 

3 40 notMemS2 

=139 notMemSI 

D 38 GND 

d37 MemnotWrDO 

=136 MemnotRfDI 

d35 MemAD2 

=3 34MemAD3 

3 33 Mem AD4 



Figure 13.5 IMS T805 84 lead quad cerpack package pinout 
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Figure 13.6 84 lead quad cerpack package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


38.100 


±0.508 


1.500 


±0.020 




B 


26.924 


±0.305 


1.060 


±0.012 




C 


20.574 


±0.203 


0.810 


±0.008 




D 


19.558 


±0.254 


0.770 


±0.010 




E 


0.508 




0.020 






F 


1.270 


±0.051 


0.050 


±0.002 




G 


2.489 


±0.305 


0.098 


±0.012 




H 


0.635 


±0.076 


0.025 


±0.003 




J 


1.143 


±0.102 


0.045 


±0.004 




K 


3.099 




0.122 




Max. 


L 


27.940 




1.100 




Max. 


M 


0.178 


±0.025 


0.007 


±0.001 





Table 13.5 84 lead quad cerpack package dimensions 
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14 Ordering 

This section indicates tlie designation of speed and pacl<age selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 

Table 14.1 IMS T805 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IMS T805.G17S 
IMS T805-G20S 
IMS T805-G25S 
IMS T805-G30S 


17.5 MHz 
20.0 MHz 
25.0 MHz 
30.0 MHz 


57 ns 
50 ns 
40 ns 
33 ns 


3.5 
4.0 
5.0 
6.0 


Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 


IMS T805-J17S 
IMS T805-J20S 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Plastic PLCC J-Bend 
Plastic PLCC J-Bend 


IMS T805-G17M 
IMS T805-G20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Ceramic Pin Grid MIL Spec 
Ceramic Pin Grid MIL Spec 


IMS T805-Q17M 
IMS T805-Q20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Quad Cerpack MIL Spec 
Quad Cerpack MIL Spec 



The timing parameters in this datasheet are based on 17 MHz and 20 MHz parts. Data for higher speeds is 
based on tests on a limited number of samples and may change when full characterisation is completed. 
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IMS T801 

transputer 



QODmos 



Preliminary Data 



FEATURES 

32 bit architecture 

33 ns internal cycle time 

30 MIPS (peak) instruction rate 

4.3 Mflops (peak) instruction rate 

Debugging support 

64 bit on-chip floating point unit which conforms to 

IEEE 754 

4 Kbytes on-chip static RAM 

120 Mbytes/sec sustained data rate to internal memory 

4 Gbytes directly addressable external memory 

60 Mbytes/sec sustained data rate to external memory 

630 ns response to interrupts 

Four INMOS serial links 10/20 Mbits/sec 

Bi-directional data rate of 2.4 Mbytes/sec per link 

High performance graphics support with block move 

instructions 

Boot from ROM or communication links 

Single 5 MHz clock Input 

Single +5V ±5% power supply 

MIL-STD-883C processing will be available 

APPLICATIONS 

Scientific and mathematical applications 

High speed multi processor systems 

High performance graphics processing 

Supercomputers 

Workstations and workstation clusters 

Digital signal processing 

Accelerator processors 

Distributed databases 

System simulation 

Telecommunications 

Robotics 

Fault tolerant systems 

Image processing 

Pattern recognition 

Artificial intelligence 
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1 Introduction 

The IMS T801 transputer is a 32 bit CMOS microcomputer with a 64 bit floating point unit and graphics support. 
It has 4 Kbytes on-chip RAM for high speed processing, a 32 bit non-multiplexed external memory Interface 
and four standard INMOS communication links. The instruction set achieves efficient implementation of high 
level languages and provides direct support for the Occam model of concurrency when using either a single 
transputer or a network. Procedure calls, process switching and typical Interrupt latency are sub-microsecond. 

For convenience of description, the IMS T801 operation is split into the basic blocks shown in figure 1.1. 



vcc 

GND 

CapPlus 

CapMinus 

Reset 

Analyse 

ErrorOut 

BootFromROM 

Clockin 

ProcSpeedSefectO-2 



ProcClockOut - 
notMemWrBO-3 - 



notMemCE -*- 



MemWalt ■ 
MemReq - 
MemG ranted 



Floating Point Unit 





System 
services 
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"* 
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32 
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32 
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V Interface 



32 



Link 



V Interface 



Event 



LinkSpeed 

LinkInO 
LinkOutO 

Linkim 
LinkOutI 

Linkln2 
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LinklnS 
' LinkOutS 

Event Req 



32 



-*- EventAck 
-*- EventWaiting 

~N MemDO-31 



30 



^ 



MemA2-31 



Figure 1.1 IMS T801 block diagram 

The processor speed of a device can be pin-selected in stages from 17.5 MHz up to the maximum allowed 
for the part. A device running at 30 MHz achieves an instruction throughput of 30 MIPS peak and 15 MIPS 
sustained. The extended temperature version of the device complies with MIL-STD-883C. 

The IMS T801 provides high performance arithmetic and floating point operations. The 64 bit floating point unit 
provides single and double length operation to the ANSI-IEEE 754-1985 standard for floating point arithmetic. 
It is able to perform floating point operations concurrently with the processor, sustaining a rate of 2.2 Mflops 
at a processor speed of 20 MHz and 3.3 Mflops at 30 MHz. 
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High performance graphics support is provided by microcoded block move instructions which operate at the 
speed of memory. The two-dimensional block move instructions provide for contiguous block moves as well 
as block copying of either non-zero bytes of data only or zero bytes only. Block move instructions can be used 
to provide graphics operations such as text manipulation, windowing, panning, scrolling and screen updating. 

Cyclic redundancy checking (CRC) instructions are available for use on arbitrary length serial data streams, 
to provide error detection where data integrity is critical. Another feature of the IMS T801 , useful for pattern 
recognition, is the facility to count bits set in a word. 

The IMS T801 can directly access a linear address space of 4 Gbytes. The 32 bit wide memory interface 
uses non-multiplexed data and address lines and provides a data rate of up to 4 bytes every 66 nanoseconds 
(60 Mbytes/sec) for a 30 MHz device. 

System Services include processor reset and bootstrap control, together with facilities for error analysis. 

The standard INMOS communication links allow networks of transputer family products to be constructed by 
direct point to point connections with no external logic. The IMS T801 links support the standard operating 
speed of 10 Mbits/sec, but also operate at 20 Mbits/sec. Each link can transfer data bi-directionally at up to 
2.35 Mbytes/sec. 

The transputer is designed to implement the OCCam language, detailed in the occann Reference Manual, but 
also efficiently supports other languages such as C, Pascal and Fortran. Access to the transputer at machine 
level is seldom required, but if necessary refer to the Transputer Instruction Set - A Compiler Writers' Guide, 
where the IMS T800 instruction set is applicable. 

This data sheet supplies hardware implementation and characterisation details for the IMS T801 . It is intended 
to be read in conjunction with the Transputer Architecture chapter, which details the architecture of the 
transputer and gives an overview of Occam. 

The IMS T801 instruction set contains a number of instructions to facilitate the implementation of breakpoints. 
For further information concerning breakpointing, refer to Support for debugging/breakpointing in transputers 
(technical note 61). 

Figure 1 .2 shows the internal datapaths for the IMS T801 . 
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channel address 



data address 



Figure 1.2 IMS T801 internal datapaths 
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Pin designations 



Table 2.1 IMS T801 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPlus, CapMinus 




External capacitor for internal clock power supply 


Clockln 


in 


Input clock 


ProcSpeedSelectO-2 


in 


Processor speed selectors 


Reset 


in 


Systenn reset 


ErrorOut 


out 


Error indicator 


Analyse 


in 


Error analysis 


BootFromRom 


in 


Boot from external ROM or from link 



Table 2.2 IMS T801 external memory interface 



Pin 


In/Out 


Function 


ProcClockOut 


out 


Processor clock 


MemA2-31 


out 


Thirty address lines 


DataO-31 


in/out 


Thirty-two non-multiplexed data lines 


notl\/lemWrB0-3 


out 


Four byte-addressing write strobes 


notlVlemCE 


out 


Chip enable 


MemWait 


in 


Memory cycle extender 


MemReq 


in 


Direct memory access request 


MemG ranted 


out 


Direct memory access granted 



Table 2.3 IMS T801 event 



Pin 


In/Out 


Function 


EventReq 
EventAck 
EventWalting 


in 

out 
out 


Event request 

Event request acknowledge 

Event input requested by software 



Table 2.4 IMS T801 link 



Pin 


In/Out 


Function 


LlnklnO-3 

LinkOutO-3 

LinkSpeed 


in 

out 
in 


Four serial data input channels 

Four serial data output channels 

Select speed for Links 0-3 to 10 or 20 Mbits/sec 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 186. 
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3 Processor 

The 32 bit processor contains instruction processing logic, instruction and work pointers, and an operand 
register. It directly accesses the high speed 4 Kbyte on-chip memory, which can store data or programs. 
Where larger amounts of memory or programs in ROM are required, the processor has access to 4 Gbytes 
of memory via the External Memory Interface (EMI). 



3.1 



Registers 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; six registers are used in the execution of a sequential process. The small number of 
registers, together with the simplicity of the instruction set, enables the processor to have relatively simple 
(and fast) data-paths and control logic. The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The /A, B and C registers which form an evaluation stack. 

A, B and C are sources and destinations for most arithmetic and logical operations. Loading a value into the 
stack pushes B into C, and A into B, before loading A. Storing a value from A, pops B into A and C into B. 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the ac/Gf instruction adds the top two values In the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism Is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 

Any location in memory can be accessed relative to the workpointer register, enabling the workspace to be 
of any size. 

Further register details are given in Transputer Instruction Set - A Compiler Writers' Guide. 



Registers 




Locals 




Program 


A 








B 






C 






Workspace 






*" 


Next Inst 
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Figure 3.1 Registers 
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3.2 



Instructions 



The instruction set has been designed for simple and efficient compilation of high-level languages. All in- 
structions have the same format, designed to give a compact representation of the operations occurring most 
frequently in programs. 

Each instruction consists of a single byte divided into two 4-bit parts. The four most significant bits of the byte 
are a function code and the four least significant bitfe are a data value. 





Function Data 


7 4 3 





Operand Register 





Figure 3.2 Instruction format 



3.2.1 



Direct functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Ten of these, 
shown in table 3.1, are used to encode the most important functions. 

Table 3.1 Direct functions 



load constant 


add constant 




load local 


store local 


load local pointer 


load non-local 


store non-local 




jump 


conditional jump 


call 



The most common operations in a program are the loading of small literal values and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local an6 store /oca/ instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of high level programming languages such as OCcam, C, Fortran, Pascal or ADA. 

3.2.2 Prefix functions 

Two more function codes allow the operand of any instruction to be extended in length; prefix and negative 
prefix. 

All instructions are executed by loading the four data bits into the least significant four bits of the operand 
register, which is then used as the instruction's operand. All instructions except the prefix instructions end by 
clearing the operand register, ready for the next instruction. 

The prefix instruction loads its four data bits into the operand register and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 
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The use of prefix instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form Independent of the processor 
wordlength. 

3.2.3 Indirect functions 

The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate Instruction just like any 
other. The instruction representation therefore provides for an indefinite number of operations. 

Encoding of the indirect functions is chosen so that the most frequently occurring operations are represented 
without the use of a prefix instruction. These include arithmetic, logical and comparison operations such as 
add, exclusive or and greater than. Less frequently occurring operations have encodings which require a 
single prefix operation. 

3.2.4 Expression evaluation 

Evaluation of expressions sometimes requires use of temporary variables in the workspace, but the number 
of these can be minimised by careful choice of the evaluation order. 

Table 3.2 Expression evaluation 



Program 


Mnemonic 


x:=0 


Idc 

StI X 


X := #24 


pfix 2 
Idc 4 

StI X 


X := y + z 


Idl y 
Idl z 
add 

StI X 



3.2.5 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte; that is, without 
the use of prefix instructions. Many of these instructions, such as load constant and add require just one 
processor cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive four instructions for every fetch. 

Short instructions also improve the effectiveness of instruction pre-fetch, which in turn improves processor 
performance. There is an extra word of pre-fetch buffer, so the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer is short, there is little time penalty when a jump instruction causes 
the buffer contents to be discarded. 
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3.3 



Processes and concurrency 



A process starts, performs a number of actions, and then either stops without completing or terminates 
complete. Typically, a process is a sequence of instructions. A transputer can run several processes in 
parallel (concurrently). Processes may be assigned either high or low priority, and there may be any number 
of each (page 136). 

The processor has a microcoded scheduler which enables any number of concurrent processes to be exe- 
cuted together, sharing the processor time. This removes the need for a software kernel. 



At any time, a concurrent process may be 

Active 

Inactive - 



Being executed. 

On a list waiting to be executed. 

Ready to input. 

- Ready to output. 

- Waiting until a specified time. 

The scheduler operates in such a way that inactive processes do not consume any processor time. It allocates 
a portion of the processor's time to each process in turn. Active processes waiting to be executed are held 
in two linked lists of process workspaces, one of high priority processes and one of low priority processes 
(page 136). Each list is implemented using two registers, one of which points to the first process in the list, 
the other to the last. In the Linked Process List figure 3.3, process S is executing and P, and R are active, 
awaiting execution. Only the low priority process queue registers are shown; the high priority process ones 
perform in a similar manner. 
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Figure 3.3 Linked process list 



Table 3.3 Priority queue control registers 



Function 


High Priority 


Low Priority 


Pointer to front of active process list 
Pointer to back of active process list 


FptrO 
BptrO 


Fptn 
Bptn 



Each process runs until it has completed its action, but is descheduled whilst waiting for communication from 
another process or transputer, or for a time delay to complete. In order for several processes to operate in 
parallel, a low priority process is only permitted to run for a maximum of two time slices before it is forcibly 
descheduled at the next descheduling point (page 140). The time slice period is 5120 cycles of the external 
5 MHz clock, giving ticks approximately 1 ms apart. 
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A process can only be descheduled on certain instructions, known as descheduling points (page 140). As a 
result, an expression evaluation can be guaranteed to execute without the process being timesliced partway 
through. 

Whenever a process is unable to proceed, its instruction pointer is saved in the process workspace and 
the next process taken from the list. Process scheduling pointers are updated by instructions which cause 
scheduling operations, and should not be altered directly. Actual process switch times are less than 1 /iS, as 
little state needs to be saved and it is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model, including start process 
and end process. When a main process executes a parallel construct, start process instructions are used 
to create the necessary additional concurrent processes. A start process instruction creates a new process 
by adding a new workspace to the end of the scheduling list, enabling the new concurrent process to be 
executed together with the ones already being executed. When a process Is made active it is always added 
to the end of the list, and thus cannot pre-empt processes already on the same list. 

The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the parallel construct components which have still to terminate. The 
counter is initialised to the number of components before the processes are started. Each component ends 
with an end process instruction which decrements and tests the counter. For all but the last component, the 
counter is non zero and the component is descheduled. For the last component, the counter is zero and the 
main process continues. 

3.4 Priority 

The IMS T801 supports two levels of priority. Priority 1 (low priority) processes are executed whenever there 
are no active priority (high priority) processes. 

High priority processes are expected to execute for a short time. If one or more high priority processes are 
able to proceed, then one is selected and runs until it has to wait for a communication, a timer input, or until 
it completes processing. 

If no process at high priority is able to proceed, but one or more processes at low priority are able to proceed, 
then one is selected. 

Low priority processes are periodically timesliced to provide an even distribution of processor time between 
computationally intensive tasks. 

If there are n low priority processes, then the maximum latency from the time at which a low priority process 
becomes active to the time when it starts processing is 2n-2 timeslice periods. It is then able to execute for 
between one and two timeslice periods, less any time taken by high priority processes. This assumes that 
no process monopolises the transputer's time; i.e. it has a distribution of descheduling points (page 140). 

Each timeslice period lasts for 5120 cycles of the external 5 MHz input clock (approximately 1 ms at the 
standard frequency of 5 MHz). 

If a high priority process is waiting for an external channel to become ready, and if no other high priority 
process is active, then the interrupt latency (from when the channel becomes ready to when the process 
starts executing) is typically 19 processor cycles, a maximum of 78 cycles (assuming use of on-chip RAM). 
If the floating point unit is not being used at the time then the maximum interrupt latency is only 58 cycles. 
To ensure this latency, certain instructions are interruptable. 

3.5 Communications 

Communication between processes is achieved by means of channels. Process communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 
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A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
links. The processor provides a number of operations to support message passing, the most important being 
input message and output message. 

The input message and output message instructions use the address of the channel to determine whether 
the channel is internal or external. Thus the same instruction sequence can be used for both, allowing a 
process to be written and compiled without knowledge of where its channels are connected. 

The process which first becomes ready must wait until the second one is also ready. A process performs an 
input or output by loading the evaluation stack with a pointer to a message, the address of a channel, and 
a count of the number of bytes to be transferred, and then executing an input message or output message 
instruction. Data is transferred if the other process is ready. If the channel is not ready or is an external one 
the process will deschedule. 

3.6 Block move 

The block move on the transputer moves any number of bytes from any byte boundary in memory, to any 
other byte boundary, using the smallest possible number of word read, and word or part-word writes. 

A block move instruction can be interrupted by a high priority process. On interrupt, block move is completed 
to a word boundary, independent of start position. When restarting after interrupt, the last word written is 
written again. This appears as an unnecessary read and write in the simplest case of word aligned block 
moves, and may cause problems with FIFOs. This problem can be overcome by incrementing the saved 
destination (BreglntSaveLoc) and source pointer (CreglntSaveLoc) values by BytesPerWord during the high 
priority process. 

3.7 Timers 

The transputer has two 32 bit timer clocks which 'tick' periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a specific time. 

One timer is accessible only to high priority processes and is incremented every microsecond, cycling com- 
pletely in approximately 4295 seconds. The other is accessible only to low priority processes and is incre- 
mented every 64 microseconds, giving exactly 15625 ticks in one second. It has a full period of approximately 
76 hours. 

Table 3.4 Timer registers 



CIocIkO 
C!ocl<1 
TNextRegO 
TNextRegJ 



Current value of high priority (level 0) process clock 

Current value of low priority (level 1) process clock 

Indicates time of earliest event on high priority (level 0) timer queue 

Indicates time of earliest event on low priority (level 1) timer queue 



The current value of the processor clock can be read by executing a load timer instruction. A process can 
arrange to perform a timer input, in which case it will become ready to execute after a specified time has 
been reached. The //mer /npwf instruction requires a time to be specified. If this time is in the 'past' then the 
instruction has no effect. If the time is in the 'future' then the process is descheduled. When the specified 
time is reached the process is scheduled again. 
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Figure 3.4 shows two processes waiting on the timer queue, one waiting for time 21 , the other for time 31 . 
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Figure 3.4 Timer registers 
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4 Instruction set summary 

The Function Codes table 4.8 gives the basic function code set (page 1 33). Where the operand is less than 1 6, 
a single byte encodes the complete instruction. If the operand is greater than 15, one prefix instruction {pfix) 
is required for each additional four bits of the operand. If the operand is negative the first prefix instruction 
will be nfix. 

Table 4.1 pre//x coding 







Function 


Memory 


Mnemonic 


code 


code 


Idc 


#3 


#4 


#43 


Idc 


#35 






is coded as 








pfix 


#3 


#2 


#23 


Idc 


#5 


#4 


#45 


Idc 


#987 






is coded as 








pfix 


#9 


#2 


#29 


pfix 


#8 


#2 


#28 


Idc 


#7 


#4 


#47 


Idc 


-31 {Idc 


#FFFFFFE1) 




is coded as 








nfix 


#1 


#6 


#61 


Idc 


#1 


#4 


#41 



Tables 4.9 to 4.28 give details of the operation codes. Where an operation code is less than 16 (e.g. add: 
operation code 05), the operation can be stored as a single byte comprising the operate function code F and 
the operand (5 in the example). Where an operation code is greater than 15 (e.g. ladd: operation code 16), 
the prefix function code 2 is used to extend the instruction. 



Table 4.2 operate coding 



Mnemonic 



Function 
code 



Memory 
code 



add 
is coded as 

opr 

ladd 
is coded as 

pfix 
opr 



(op. code #5) 

add 

(op. code #1 6) 

#1 
#6 



#F 



#2 
#F 



#F5 

#F5 

#21 F6 

#21 
#F6 



The load device identity (Iddevid) Instruction (table 4.20) pushes the device type identity into the A register. 
Each product is allocated a unique group of numbers for use with the /cfcfe Wof instruction. The product identity 
numbers for the IMS T801 are 20 to 29 inclusive. 



In the Floating Point Operation Codes tables 4.22 to 4.28, a selector sequence code (page 149) is indicated 
In the Memory Code column by s. The code given in the Operation Code column is the indirection code, the 
operand for the Idc instruction. 

The FPU and processor operate concurrently, so the actual throughput of floating point instructions is better 
than that implied by simply adding up the instruction times. For full details see Transputer Instruction Set - A 
Compiler Writers' Guide. 
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The Processor Cycles column refers to the number of periods TPCLPCL taken by an instruction executing 
in internal memory. The number of cycles is given for the basic operation only; where the memory code 
for an instruction is two bytes, the time for the prefix function (one cycle) should be added. For a 20 MHz 
transputer one cycle is 50 ns. Some instruction times vary. Where a letter is included in the cycles column it 
is interpreted from table 4.3. 



Table 4.3 Instruction set interpretation 



Ident 



Interpretation 



b 
m 

n 
w 



Bit number of the highest bit set in register A. Bit is the least significant bit. 

Bit number of the highest bit set in the absolute value of register A. 
Bit is the least significant bit. 

Number of places shifted. 

Number of words in the message. Part words are counted as full words. If the message 
is not word aligned the number of words is increased to include the part words at either 
end of the message. 

Number of words per row. 

Number of rows. 



The DE column of the tables indicates the descheduling/error features of an instruction as described in 
table 4.4. 

Table 4.4 Instruction features 



Ident 


Feature 


See page: 


D 
E 

F 


The instruction is a descheduling point 
The instruction will affect the Error flag 
The instruction will affect the FP.Error flag 


140 
141, 156 
149, 141 



4.1 



Descheduling points 



The instructions in table 4.5 are the only ones at which a process may be descheduled (page 135). They are 
also the ones at which the processor will halt if the Analyse pin is asserted (page 155). 



Table 4.5 Descheduling point instructions 



input message 
timer alt wait 
jump 



output message 
timer input 
loop end 



output byte 
stop on error 
end process 



output word 
alt wait 
stop process 
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4.2 



Error instructions 



The instructions in table 4.6 are tine only ones which can affect the Error flag (page 156) directly. Note, 
however, that the floating point unit error flag FP.Error'is set by certain floating point instructions (page 141), 
and that Error can be set from this flag by fpcheckerror. 

Table 4.6 Error setting instructions 



add 


add constant 


subtract 




multiply 


fractional multiply 


divide 


remainder 


long add 


long subtract 


long divide 




set error 


testerr 


fpcheckerror 




check word 


check subscript from 


check single 


check count from 1 



4.3 



Debugging support 



Table 4.21 contains a number of Instructions to facilitate the implementation of breakpoints. These instructions 
overload the operation of yO. Normally yO is a no-op which might cause descheduling. Se//(}drea/c enables the 
breakpointing facilities and causes jO to act as a breakpointing instruction. When breakpointing is enabled, 
jO swaps the current Iptr and Wptr with an Iptr and Wptr stored above MemStart. The breakpoint instruction 
does not cause descheduling, and preserves the state of the registers. It is possible to single step the pro- 
cessor at machine level using these instructions. Refer to Support for debugging/breakpointing in transputers 
(technical note 61) for more detailed information regarding debugger support. 



4.4 



Floating point errors 



The instructions in table 4.7 are the only ones which can affect the floating point error flag FP.Error (page 149). 
Error is set from this flag by fpcheckerror if FP.Error is set. 



Table 4.7 Floating point error setting instructions 



fpadd 


fpsub 


fpmul 


fpdiv 


fpldnladdsn 


fpldnladddb 


fpldnlmulsn 


fpldnlmuldb 


fpremflrst 


fpusqrtfirst 


fpgt 


fpeq 


fpuseterror 


fpuclearerror 


fptesterror 




fpuexpincby32 


fpuexpdecby32 


fpumulby2 


fpudivby2 


fpur32tor64 


fpur64tor32 


fpucki32 


fpuckiSA 


fprtoi32 


fpuabs 


fpint 
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Table 4.8 IMS T801 function codes 



Function 


IVlemory 




Processor 




D 


Code 


Code 


IVInemonic 


Cycles 


Name 


E 





OX 


J 


3 


jump 


D 


1 


1X 


Idip 


1 


load local pointer 




2 


2X 


pfix 


1 


prefix 




3 


3X 


Idnl 


2 


load non-local 




4 


4X 


Idc 


1 


load constant 




5 


5X 


Idnip 


1 


load non-local pointer 




6 


6X 


nfix 


1 


negative prefix 




7 


7X 


Idl 


2 


load local 




8 


8X 


adc 


1 


add constant 


E 


9 


9X 


call 


7 


call 




A 


AX 


cj 


2 

4 


conditional jump (not taken) 
conditional jump (taken) 




B 


BX 


ajw 


1 


adjust workspace 




C 


OX 


eqc 


2 


equals constant 




D 


DX 


sti 


1 


store local 




E 


EX 


stnl 


2 


store non-local 




F 


FX 


opr 


- 


operate 





Table 4.9 IMS T801 arithmetic/logical operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


46 


24F6 


and 


1 


and 




4B 


24FB 


or 


1 


or 




33 


23F3 


xor 


1 


exclusive or 




32 


23F2 


not 


1 


bitwise not 




41 


24F1 


shI 


n+2 


shift left 




40 


24F0 


shr 


n+2 


shift right 




05 


F5 


add 


1 


add 


E 


OC 


FC 


sub 


1 


subtract 


E 


53 


25F3 


mul 


38 


multiply 


E 


72 


27F2 


fmul 


35 


fractional multiply (no rounding) 


E 








40 


fractional multiply (rounding) 


E 


20 


22FC 


div 


39 


divide 


E 


1F 


21 FF 


rem 


37 


remainder 


E 


09 


F9 


gt 


2 


greater than 




04 


F4 


diff 


1 


difference 




52 


25F2 


sum 


1 


sum 




08 


F8 


prod 


b+4 
m+5 


product for positive register A 
product for negative register A 
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Table 4.10 IMS T801 long arithmetic operation codes 



Operation 


IVlemory 




Processor 




D 


Code 


Code 


IVInemonic 


Cycles 


Name 


E 


16 


21 F6 


ladd 


2 


long add 


E 


38 


23F8 


Isub 


2 


long subtract 


E 


37 


23F7 


Isum 


3 


long sum 




4F 


24FF 


Idiff 


3 


long diff 




31 


23F1 


Imul 


33 


long multiply 




1A 


21 FA 


Idiv 


35 


long divide 


E 


36 


23F6 


IshI 


n+3 
n-28 


long shift left (n<32) 
long shift left(n>32) 




35 


23F5 


Ishr 


n+3 
n-28 


long shift right (n<32) 
long shift right (n>32) 




19 


21 F9 


norm 


n+5 
n-26 

3 


normalise (n<32) 
normalise (n>32) 
normalise (n=64) 





Table 4.11 IMS T801 general operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


00 


FO 


rev 


1 


reverse 




3A 


23FA 


xword 


4 


extend to word 




56 


25F6 


cword 


5 


check word 


E 


ID 


21 FD 


xdble 


2 


extend to double 




40 


24FC 


csngi 


3 


check single 


E 


42 


24F2 


mint 


1 


minimum integer 




5A 


25FA 


dup 


1 


duplicate top of stack 




79 


27F9 


pop 


1 


pop processor stack 





Table 4.12 IMS T801 2D block move operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


5B 


25FB 


move2dinit 


8 


initialise data for 2D block move 




50 


25FO 


move2dall 


(2p+23)*r 


2D block copy 




5D 


25FD 


move2dnonzero 


(2p+23)*r 


2D block copy non-zero bytes 




5E 


25FE 


move2dzero 


(2p+23)*r 


2D block copy zero bytes 





Table 4.13 IMS T801 CRC and bit operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


74 
75 

76 
11 
78 


27F4 
27F5 

27F6 
27F7 
27F8 


crcword 
crcbyte 

bitcnt 

bitrevword 

bitrevnbits 


35 
11 

b+2 
36 
n+4 


calculate crc on word 
calculate crc on byte 

count bits set in word 
reverse bits in word 
reverse bottom n bits in word 
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Table 4.14 IMS T801 indexing/array operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


02 
OA 
81 
34 
3F 


F2 

FA 

28F1 

23F4 

23FF 


bsub 

wsub 

wsubdb 

bent 

went 


1 
2 
3 
2 
5 


byte subscript 

word subscript 

form double word subscript 

byte count 

word count 




01 
3B 


F1 
23FB 


lb 
sb 


5 
4 


load byte 
store byte 




4A 


24FA 


move 


2w+8 


move message 





Table 4.15 IMS T801 timer handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


22 


22F2 


Idtimer 


2 


load timer 




2B 
4E 


22FB 
24FE 


tin 
talt 


30 
4 
4 


timer input (time future) 
timer input (time past) 
timer alt start 


D 
D 


51 
47 


25F1 
24F7 


taltwt 
enbt 


15 
48 
8 


timer alt wait (time past) 
timer alt wait (time future) 
enable timer 


D 
D 


2E 


22FE 


dist 


23 


disable timer 





Table 4.16 IMS T801 input/output operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


07 
OB 
OF 
OE 


F7 
FB 
FF 
FE 


in 

out 

outword 
outbyte 


2W+19 
2W+19 

23 

23 


input message 
output message 
output word 
output byte 


D 
D 
D 
D 


43 


24F3 


alt 


2 


alt start 




44 
45 


24F4 
24F5 


altwt 
altend 


5 
17 
4 


alt wait (channel ready) 
alt wait (channel not ready) 
alt end 


D 
D 


49 
30 


24F9 
23F0 


enbs 
diss 


3 
4 


enable skip 
disable skip 




12 


21 F2 


resetch 


3 


reset channel 




48 
2F 


24F8 
22FF 


enbe 
disc 


7 
5 
8 


enable channel (ready) 
enable channel (not ready) 
disable channel 
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Table 4.17 IMS T801 control operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


20 
IB 
30 
06 
21 


22F0 
21 FB 
23FO 

F6 
22F1 


ret 

Idpi 

gajw 

gcall 

lend 


5 
2 
2 

4 
10 
5 


return 

load pointer to instruction 

general adjust workspace 

general call 

loop end (loop) 

loop end (exit) 


D 
D 



Table 4.18 IMS T801 scheduling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


OD 


FD 


startp 


12 


start process 


D 


03 


F3 


endp 


13 


end process 


D 


39 


23F9 


runp 


10 


run process 




15 


21 F5 


stopp 


11 


stop process 




IE 


21 FE 


Idpri 


1 


load current priority 





Table 4.19 IMS T801 error handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


13 
4D 


21 F3 
24FD 


csubO 
ccnti 


2 
3 


check subscript from 
check count from 1 


E 
E 


29 

10 


22F9 
21 FO 


testerr 
seterr 


2 
3 

1 


test error false and clear (no error) 
test error false and clear (error) 
set error 


E 


55 
57 


25F5 
25F7 


stoperr 
cirhalterr 


2 

1 


stop on error (no error) 
clear halt-on-error 


D 


58 


25F8 


sethalterr 


1 


set halt-on-error 




59 


25F9 


testhalterr 


2 


test halt-on-error 





Table 4.20 IMS T801 processor initialisation operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


2A 


22FA 


testpranal 


2 


test processor analysing 




3E 


23FE 


saveh 


4 


save high priority queue registers 




3D 


23FD 


savel 


4 


save low priority queue registers 




18 


21 F8 


sthf 




store high priority front pointer 




50 


25F0 


sthb 




store high priority back pointer 




1C 


21 FC 


stif 




store low priority front pointer 




17 


21 F7 


stib 




store low priority back pointer 




54 


25F4 


sttimer 




store timer 




170 


2127FC 


Iddevid 




load device identity 




7E 


27FE 


Idmemstartval 




load value of memstart address 
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Table 4.21 IMS T801 debugger support codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 





00 


jump 


3 

11 
13 


jump (break not enabled) 

jump (break enabled, high priority) 

jump (break enabled, low priority) 


D 


B1 


2BF1 


break 


9 

11 


break (high priority) 
break (low priority) 




B2 


28F2 


cIrjObreak 


1 


clear jump break enable flag 




83 


28F3 


setjObreak 


1 


set jump break enable flag 




84 


2BF4 


testjObreak 


2 


test jump break enable flag set 




7A 


27FA 


timerdisableh 


1 


disable high priority timer interrupt 




78 


27F8 


timerdisablel 


1 


disable low priority timer interrupt 




70 


27FC 


timerenableh 


6 


enable high priority timer interrupt 




7D 


27FD 


timerenablel 


6 


enable low priority timer interrupt 





Table 4.22 IMS T801 floating point load/store operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


8E 


28FE 


fpldnlsn 


2 


fp load non-local single 




8A 


28FA 


fpldnldb 


3 


fp load non-local double 




86 


28F6 


fpldnlsnl 


4 


fp load non-local indexed single 




82 


28F2 


fpldnldbi 


6 


fp load non-local indexed double 




9F 


29FF 


fpldzerosn 


2 


load zero single 




AO 


2AF0 


fpldzerodb 


2 


load zero double 




AA 


2AFA 


fpldnladdsn 


8/11 


fp load non local & add single 


F 


A6 


2AF6 


fpldnladddb 


9/12 


fp load non local & add double 


F 


AG 


2AFG 


fpldnlmulsn 


13/20 


fp load non local & multiply single 


F 


A8 


2AF8 


fpldnlmuldb 


21/30 


fp load non local & multiply double 


F 


88 


28F8 


fpstnisn 


2 


fp store non-local single 




84 


28F4 


fpstnldb 


3 


fp store non-local double 




9E 


29FE 


fpstnli32 


4 


store non-local int32 





Processor cycles are shown as Typical/Maximum cycles. 



Table 4.23 IMS T801 floating point general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


AB 
A4 
A3 


2AFB 
2AF4 
2AF3 


fpentry 

fprev 

fpdup 


1 
1 
1 


floating point unit entry 
fp reverse 
fp duplicate 
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Table 4.24 IMS T801 floating point rounding operation codes 



Operation 


IVIemory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


22 


s 


fpurn 


1 


set rounding mode to round nearest 




06 


s 


fpurz 


1 


set rounding mode to round zero 




04 


s 


fpurp 


1 


set rounding mode to round positive 




05 


s 


fpurm 


1 


set rounding mode to round minus 





Table 4.25 IMS T801 floating point error operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


83 
90 
23 
90 


28F3 
29F0 

s 

s 


fpchkerror 
fptesterror 
fpuseterror 
fpuclearerror 


1 

2 
1 
1 


check fp error 

test fp error false and clear 

set fp error 

clear fp error 


E 
F 
F 
F 



Table 4.26 IMS T801 floating point comparison operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


94 


29F4 


fpgt 


4/6 


fp greater than 


F 


95 


29F5 


fpeq 


3/5 


fp equality 


F 


92 


29F2 


fpordered 


3/4 


fp orderability 




91 


29F1 


fpnan 


2/3 


fpNaN 




93 


29F3 


fpnotfinite 


2/2 


fp not finite 




OE 


s 


fpuchkl32 


3/4 


check in range of type int32 


F 


OF 


s 


fpuchkl64 


3/4 


check in range of type int64 


F 



Processor cycles are shown as Typical/Maximum cycles. 



Table 4.27 IMS T801 floating point conversion operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


07 


s 


fpur32tor64 


3/4 


real32 to real64 


F 


08 


s 


fpur64tor32 


6/9 


real64 to real32 


F 


9D 


29FD 


fprtoi32 


7/9 


real to int32 


F 


96 


29F6 


fpi32tor32 


8/10 


int32 to real32 




98 


29F8 


fpi32tor64 


8/10 


int32 to rea!64 




9A 


29FA 


fpb32tor64 


8/8 


bJt32 to reaI64 




OD 


s 


fpunoround 


2/2 


real64 to real32, no round 




A1 


2AF1 


fpint 


5/6 


round to floating integer 


F 



Processor cycles are shown as Typical/Maximum cycles. 
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Table 4.28 IMS T801 floating point arithmetic operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor cycles 


Name 


D 

E 


Single 


Double 


87 


28F7 


fpadd 


6/9 


6/9 


fpadd 


F 


89 


28F9 


fpsub 


6/9 


6/9 


fp subtract 


F 


8B 


28FB 


fpmul 


11/18 


18/27 


fp multiply 


F 


80 


28FC 


fpdiv 


16/28 


31/43 


fp divide 


F 


OB 


s 


fpuabs 


2/2 


2/2 


fp absolute 


F 


8F 


28FF 


fpremfirst 


36/46 


36/46 


fp remainder first step 


F 


90 


29F0 


fpremstep 


32/36 


32/36 


fp remainder iteration 




01 


s 


fpusqrtflrst 


27/29 


27/29 


fp square root first step 


F 


02 


s 


fpusqrtstep 


42/42 


42/42 


fp square root step 




03 


s 


fpusqrtlast 


8/9 


8/9 


fp square root end 




OA 


s 


fpuexplnc32 


6/9 


6/9 


multiply by 2^^ 


F 


09 


s 


fpuexpdec32 


6/9 


6/9 


divide by 2^^ 


F 


12 


s 


fpumulby2 


6/9 


6/9 


multiply by 2.0 


F 


11 


s 


fpudivby2 


6/9 


6/9 


divide by 2.0 


F 



Processor cycles are shown as Typical/Maximum cycles. 
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5 Floating point unit 

The 64 bit FPU provides single and double length arithmetic to floating point standard ANSI-IEEE 754-1985. 
It is able to perform floating point arithmetic concurrently with the central processor unit (CPU), sustaining 
3.3 Mflops on a 30 MHz device. All data communication between memory and the FPU occurs under control 
of the CPU. 

The FPU consists of a microcoded computing engine with a three deep floating point evaluation stack for 
manipulation of floating point numbers. These stack registers are FA, FB and FC, each of which can hold 
either 32 bit or 64 bit data; an associated flag, set when a floating point value is loaded, indicates which. The 
stack behaves in a similar manner to the CPU stack (page 132). 

As with the CPU stack, the FPU stack is not saved when rescheduling (page 135) occurs. The FPU can 
be used in both low and high priority processes. When a high priority process interrupts a low priority one 
the FPU state is saved inside the FPU. The CPU will service the interrupt immediately on completing its 
current operation. The high priority process will not start, however, before the FPU has completed its current 
operation. 

Points in an instruction stream where data need to be transferred to or from the FPU are called synchronisation 
points. At a synchronisation point the first processing unit to become ready will wait until the other is ready. 
The data transfer will then occur and both processors will proceed concurrently again. In order to make 
full use of concurrency, floating point data source and destination addresses can be calculated by the CPU 
whilst the FPU is performing operations on a previous set of data. Device performance is thus optimised by 
minimising the CPU and FPU idle times. 

The FPU has been designed to operate on both single length (32 bit) and double length (64 bit) floating 
point numbers, and returns results which fully conform to the ANSI-IEEE 754-1985 floating point arithmetic 
standard. Denormalised numbers are fully supported in the hardware. All rounding modes defined by the 
standard are implemented, with the default being round to nearest. 

The basic addition, subtraction, multiplication and division operations are performed by single instructions. 
However, certain less frequently used floating point instructions are selected by a value in register A (when 
allocating registers, this should be taken into account). A load constant instruction Idc is used to load 
register A; the floating point entry instruction fpenfry then uses this value to select the floating point operation. 
This pair of instructions is termed a selector sequence. 

Names of operations which use fpentry begin with fpu. A typical usage, returning the absolute value of a 
floating point number, would be 

Idc fpuabs; fpentry; 

Since the indirection code for fpuabs is OB, it would be encoded as 



Table 5.1 fpentry co6\nQ 



Mnemonic 



Function 
code 



Memory 
code 



Idc 

fpentry 
is coded as 

pfix 
opr 



fpuabs 

(op. code #AB) 

#A 
#B 



#4 



#2 
#F 



#4B 

#2AFB 

#2A 
#FB 
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The remainder an6 square root instructions take considerably longer than other instructions to complete. In 
order to minimise the interrupt latency period of the transputer they are split up to form instruction sequences. 
As an example, the instruction sequence for a single length square root is 

fpusqrtfirst; fpusqrtstep; fpusqrtstep; fpusqrtlast; 

The FPU has its own error flag FP.Error This reflects the state of evaluation within the FPU and is set in 
circumstances where invalid operations, division by zero or overflow exceptions to the ANSI-IEEE 754-1985 
standard would be flagged (page 141). FP.Error'is also set if an input to a floating point operation is infinite or 
is not a number (NaN). The FP.Error flag can be set, tested and cleared without affecting the main Error flag, 
but can also set Error when required (page 141). Depending on how a program is compiled, it Is possible for 
both unchecked and fully checked floating point arithmetic to be performed. 

Further details on the operation of the FPU can be found in Transputer Instruction Set - A Compiler Writers' 
Guide. 

Table 5.2 Typical floating point operation times for IMS T801 



Operation 


T801-20 


T801.30 1 


Single length 


Double length 


Single length 


Double length 


add 

subtract 
multiply 
divide 


350 ns 
350 ns 
550 ns 
850 ns 


350 ns 

350 ns 

1000 ns 

1600 ns 


233 ns 
233 ns 
367 ns 
567 ns 


233 ns 
233 ns 
667 ns 

1067 ns 



Timing is for operations where both operands are normalised fp numbers. 
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6 System services 

System services include all the necessary logic to initialise and sustain operation of the device. They also 
include error handling and analysis facilities. 



6.1 



Power 



Power is supplied to the device via the VCC and GND pins. The supply must be decoupled close to the chip 
by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer boards 
are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, otherwise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 



6.2 



CapPlus, CapMinus 



The internally derived power supply for internal clocks requires an external low leakage, low inductance 1/iF 
capacitor to be connected between CapPlus and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz and 10 MHz. If a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total RGB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 
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Phase-locked 
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CapMinus 
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Figure 6.1 Recommended PLL decoupling 



6.3 



Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer devices may be connected to a common clock, or may have individual clocks providing 
each one meets the specified stability criteria. In a multi-clock system the relative phasing of Clockln clocks 
is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant provided the 
specified limits of Clockln pulse widths are met. 

Oscillator stability Is important. Clockln must be derived from a crystal oscillator; RC oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 6.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1,2,4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1,3 


TDC1 DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise time 






10 


ns 


1,5 


TDCf 


Clockln fall time 






8 


ns 


1,5 



Notes 



1 These parameters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200 ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 1 1.3). 
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Figure 6.2 Clockln timing 



6.4 



ProcSpeedSelectO-2 



Processor speed of the IMS T801 is variable in discrete steps. The desired speed can be selected, up to the 
maximum rated for a particular component, by the three speed select lines ProcSpeedSelectO-2. The pins 
are tied high or low, according to table 6.2, for the various speeds. The frequency of Clockln for the speeds 
given in table 6.2 is 5 MHz. There are six valid speed select combinations. 
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Table 6.2 Processor speed selection 



Proc 


Proc 


Proc 


Processor 


Processor 




Speed 
Select2 


Speed 
Selecti 


Speed 
SelectO 


Clock 
Speed MHz 


Cycle 
Time ns 


Notes 











20.0 


50.0 










1 


22.5 


44.4 







1 





25.0 


40.0 







1 


1 


30.0 


33.3 




1 








35.0 


28.6 




1 





1 






Invalid 


1 


1 





17.5 


57.1 




1 


1 


1 






Invalid 



Note: Inclusion of a speed selection in this table does not imply immediate availability. 



6.5 



Reset 



Reset can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. After VCC 
is valid Clockln should be running for a minimum period TDCVRL before the end of Reset. The falling edge 
of Reset initialises the transputer and starts the bootstrap routine. Link outputs are forced low during reset; 
link Inputs and EventReq should be held low. Memory request (DMA) must not occur whilst Reset is high 
but can occur before bootstrap (page 167). 

If BootFromRom is high, bootstrapping will take place immediately after Reset goes low, using data from 
external memory; othenwise the transputer will await an input from any link. The processor will be in the low 
priority state. 



6.6 



Bootstrap 



The transputer can be bootstrapped either from a link or from external ROM. To facilitate debugging, Boot- 
FromRom may be dynamically changed but must obey the specified timing restrictions. It Is sampled once 
only by the transputer, before the first instruction is executed after Reset is taken low. 

If BootFromRom is connected high (e.g. to VCC) the transputer starts to execute code from the top two bytes 
in external memory, at address #7FFFFFFE. This location should contain a backward jump to a program in 
ROM. Following this access, BootFromRom may be taken low if required. The processor is in the low priority 
state, and the 1/1/ register points to MemStart (page 157). 
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Table 6.3 Reset and Analyse 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 


Power valid before Reset 


10 






ms 




TRHRL 


Reset pulse width high 


8 






Clockln 


1 


TDCVRL 


Clockln running before Reset end 


10 






ms 


2 


TAHRH 


Analyse setup before Reset 


3 






ms 




TRIAL 


Analyse hold after Reset end 


1 






Clockln 


1 


TBRVRL 


BootFromRom setup 









ms 




TRLBRX 


BootFromRom hold after Reset 









ms 


3 


TALBRX 


BootFromRom hold after Analyse 










3 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 

3 Must be stable until after end of bootstrap period. See Bootstrap section. 
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Figure 6.3 Transputer reset timing with Analyse low 
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Figure 6.4 Transputer reset and analyse timing 
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If BootFromRom is connected low (e.g. to GND) the transputer will wait for the first bootstrap message to 
arrive on any one of its links. The transputer is ready to receive the first byte on a link within two processor 
cycles TPCLPCL after Reset goes low. 

If the first byte received (the control byte) is greater than 1 it is taken as the quantity of bytes to be input. The 
following bytes, to that quantity, are then placed in internal memory starting at location MemStart. Following 
reception of the last byte the transputer will start executing code at MemStart as a low priority process. 
BootFromRom may be taken high after reception of the last byte, if required. The memory space immediately 
above the loaded code is used as work space. Messages arriving on other links after the control byte has 
been received and on the bootstrapping link after the last bootstrap byte will be retained until a process inputs 
from them. 

6.7 Peek and poke 

Any location in internal or external memory can be interrogated and altered when the transputer is waiting 
for a bootstrap from link. If the control byte is then eight more bytes are expected on the same link. The 
first four byte word Is taken as an internal or external memory address at which to poke (write) the second 
four byte word. If the control byte is 1 the next four bytes are used as the address from which to peek (read) 
a word of data; the word is sent down the output channel of the same link. 

Following such a peek or poke, the transputer returns to its previously held state. Any number of accesses 
may be made in this way until the control byte is greater than 1, when the transputer will commence reading 
its bootstrap program. Any link can be used, but addresses and data must be transmitted via the same link 
as the control byte. 

6.8 Analyse 

If Analyse is taken high when the transputer is running, the transputer will halt at the next descheduling point 
(page 140). From Analyse being asserted, the processor will halt within three time slice periods plus the 
time taken for any high priority process to complete. As much of the transputer status Is maintained as is 
necessary to permit analysis of the halted machine. Processor flags Error and HaltOn Error are not altered at 
reset, whether Analyse is asserted or not. Memory refresh continues. 

Input links will continue with outstanding transfers. Output links will not make another access to memory 
for data but will transmit only those bytes already in the link buffer. Providing there is no delay in link 
acknowledgement, the links should be inactive within a few microseconds of the transputer halting. 

Reset should not be asserted before the transputer has halted and link transfers have ceased. When Reset 
is taken low whilst Analyse is high, neither the memory configuration sequence nor the block of eight refresh 
cycles will occur; the previous memory configuration will be used for any external memory accesses. If 
BootFromRom is high the transputer will bootstrap as soon as Analyse is taken low, othen^/ise it will await a 
control byte on any link. If Analyse is taken low without Reset going high the transputer state and operation 
are undefined. After the end of a valid Analyse sequence the registers have the values given In table 6.4. 

Table 6.4 Register values after Analyse 



/ MemStart if bootstrapping from a link, or the external memory bootstrap address if 
bootstrapping from ROM. 

W MemStart if bootstrapping from ROM, or the address of the first free word after the 
bootstrap program if bootstrapping from link. 

A The value of / when the processor halted. 

B The value of Wv^hen the processor halted, together with the priority of the process 
when the transputer was halted (i.e. the IV descriptor). 

C The ID of the bootstrapping link if bootstrapping from link. 
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6.9 



ErrorOut 



The ErrorOut pin is connected directly to the internal Error flag and follows the state of that flag. If ErrorOut 
is high it indicates an error in one of the processes caused, for example, by arithmetic overflow, divide by 
zero, array bounds violation or software setting the flag directly (page 1 41 ). It can also be set from the floating 
point unit under certain circumstances (page 141 , 149). Once set, the Error flag Is only cleared by executing 
the instruction testerr. The error is not cleared by processor reset, in order that analysis can identify any 
errant transputer (page 155). 

A process can be programmed to stop if the Error flag is set; it cannot then transmit erroneous data to other 
processes, but processes which do not require that data can still be scheduled. Eventually all processes 
which rely, directly or Indirectly, on data from the process in error will stop through lack of data. 

By setting the Ha//0/7Error flag the transputer itself can be programmed to halt if Error becomes set. If Error 
becomes set after HaltOnErrorhas been set, all processes on that transputer will cease but will not necessarily 
cause other transputers in a network to halt. Setting HaltOn Error aHer Error will not cause the transputer to 
halt; this allows the processor reset and analyse facilities to function with the flags in indeterminate states. 

An alternative method of error handling is to have the errant process or transputer cause all transputers to 
halt. This can be done by applying the ErrorOut output signal of the errant transputer to the EventReq pin 
of a suitably programmed master transputer. Since the process state is preserved when stopped by an error, 
the master transputer can then use the analyse function to debug the fault. When using such a circuit, note 
that the Error flag is in an indeterminate state on power up; the circuit and software should be designed with 
this in mind. 

Error checks can be removed completely to optimise the performance of a proven program; any unexpected 
error then occurring will have an arbitrary undefined effect. 

If a high priority process pre-empts a low priority one, status of the Error and HaltOnError flags is saved 
for the duration of the high priority process and restored at the conclusion of it. Status of the Error flag is 
transmitted to the high priority process but the HaltOnError flag is cleared before the process starts. Either 
flag can be altered in the process without upsetting the error status of any complex operation being carried 
out by the pre-empted low priority process. 

In the event of a transputer halting because of HaltOnError, the links will finish outstanding transfers before 
shutting down. If Analyse is asserted then all inputs continue but outputs will not make another access to 
memory for data. Memory refresh will continue to take place. 

After halting due to the Error flag changing from to 1 whilst HaltOnError Is set, register /points two bytes 
past the instruction which set Error. After halting due to the Analyse pin being taken high, register / points 
one byte past the instruction being executed. In both cases /will be copied to register A. 
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Figure 6.5 Error handling in a multi-transputer system 
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7 Memory 

The IMS T801 can access 4 Gbytes of external memory space. The IMS T801 also has 4 Kbytes of fast 
internal static memory for high rates of data throughput. Each internal memory access takes one processor 
cycle ProcClockOut (page 162). Internal and external memory are part of the same linear address space. 

IMS T801 memory is byte addressed, with words aligned on four-byte boundaries. The least significant byte 
of a word is the lowest addressed byte. 

The bits in a byte are numbered to 7, with bit the least significant. The bytes are numbered from 0, with 
byte the least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Internal memory starts at the most negative address #80000000 and extends to #80000FFF. User memory 
begins at #80000070; this location is given the name MemStart. 

The reserved area of internal memory below MemStart is used to implement link and event channels. 

Two words of memory are reserved for timer use, TPtrLocO for high priority processes and TPtrLod for low 
priority processes. They either indicate the relevant priority timer is not in use or point to the first process on 
the timer queue at that priority level. 

Values of certain processor registers for the current low priority process are saved In the reserved IntSaveLoc 
locations when a high priority process pre-empts a low priority one. Other locations are reserved for extended 
features such as block moves and floating point operations. 

External memory space starts at #80001000 and extends up through #00000000 to #7FFFFFFF. ROM boot- 
strapping code must be In the most positive address space, starting at #7FFFFFFE. Address space immedi- 
ately below this is conventionally used for ROM based code. 
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Figure 7.1 IMS T801 memory map 



1 These locations are used as auxiliary processor registers and should not be manipulated by the user. Like 
processor registers, their contents may be useful for implementing debugging tools (Analyse, page 155). For 
details see Transputer Instruction Set - A Compiler Writers' Guide. 
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8 External memory interface 

The IMS T801 External Memory Interface (EMI) allows access to a 32 bit address space via separate address 
and data buses. 

The external mennory cycle is divided into four Tstates with the following functions: 

11 Address and control setup time. 

T2 Data setup time. 

T3 Data read/write. 

T4 Data and address hold after access. 

Each Tstate is half a processor cycle TPCLPCL long. An external memory cycle is always a complete 
number of cycles TPCLPCL in length. The start of T1 always coincides with a rising edge of ProcClockOut. 
T2 can be extended indefinitely by adding externally generated wait states of one complete processor cycle 
each. 

During an internal memory access cycle the external memory interface address bus MemA2-31 reflects the 
word address used to access internal RAM, notMemWrBO-3 and notMemCE are inactive and the data bus 
MemDO-31 is tristated. This is true unless and until a DMA (memory request) activity takes place, when the 
MemA2-31, MemDO-31, notMemCE and notMemWrBO-3 signals will be placed In a high Impedance state 
by the transputer. 

Bus activity Is not adequate to trace the internal operation of the transputer in full, but may be used for 
hardware debugging in conjuction with peek and poke (page 155). 
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Figure 8.1 IMS T801 bus activity for 3 internal memory cycles 
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8.1 
8.1.1 



Pin functions 
MemA2-31 



External memory addresses are output on a non-multiplexed 30 bit bus. The address is valid at the start of 
T1 and remains so until the end of T4. 

8.1.2 MemDO-31 

The non-multiplexed data bus is 32 bits wide. The data bus is high impedance except when the transputer is 
writing data. If only one byte is being written, the unused 24 bits of the bus are high impedance at that time. 

If the data setup time for read or write is too short it can be extended by inserting wait states at the end of 
T2. 

8.1.3 notlVlemCE 

The active low signal notl\/lemCE is used to enable external memory on both read and write cycles. 

Table 8.1 notMemCE to ProcClocl<Out skew 



SYIVIBOL 


PARAIVIETER 


7801-30 


T801-25 


T801-20 


NOTE 


IVilN 


IVIAX 


IVIIN 


IVIAX 


IVIIN 


MAX 


TPCHEL 
TPCLEH 


notMemCE falling 

from ProcClockOut rising 

ProcClockOut falling 
to notMemCE rising 


6 
6 


10 
10 


8 
8 


12 
12 


10 
10 


14 
14 


1 
1 



Notes 



1 Units are ns. 
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Figure 8.2 IMS T801 skew of notMemCE to ProcClockOut 
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Figure 8.3 IMS T801 static RAM application 



8.1.4 



notMemWrBO-3 



Four write enables notMemWrBO-3 are provided, one to write each byte of a word. When writing a word, the 
four appropriate write enables are asserted; when writing a byte only the appropriate write enable is asserted. 



8.1.5 



MemWait 



Wait states can be selected by taking MemWait high. Externally generated wait states of one complete 
processor cycle can be added to extend the duration of T2 indefinitely. 



8.1.6 MemReq, MemGranted 

Direct memory access (DMA) can be requested at any time by driving the asynchronous MemReq input high. 

MemGranted follows the timing of the bus being tristated and can be used to signal to the device requesting 
the DMA that it has control of the bus. Note that MemGranted changes on the falling edge of ProcClockOut 
and can therefore be sampled to establish control of the bus on the rising edge of ProcClockOut. 
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8.1.7 ProcClockOut 

This clock is derived from the internal processor clock, which is in turn derived from Clockin. Its period is 
equal to one internal microcode cycle time, and can be derived from the formula 

TPCLPCL = TDCLDCL / PLLx 

where TPCLPCL is the ProcClockOut Period, TDCLDCL is the Clockin Period and PLLx is the phase 
lock loop factor for the relevant speed part, obtained from the ordering details (Ordering section). 

Edges of the various external memory strobes are synchronised by, but do not all coincide with, rising or 
falling edges of ProcClockOut. 

Table 8.2 ProcClockOut 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLPCL 
TPCHPCL 
TPCLPCH 
TPCstab 


ProcClockOut period 
ProcClockOut pulse width high 
ProcClockOut pulse width low 
ProcClockOut stability 


a-1 
b-2.5 


a 
b 
c 


a+1 
b+2.5 

4 


ns 
ns 
ns 
% 


1 
2 
3 
4 



Notes 



1 a is TDCLDCL/PLLx. 

2 b is 0.5*TPCLPCL (half the processor clock period). 

3 c is TPCLPCL-TPCHPCL. 

4 Stability is the variation of cycle periods between two consecutive cycles, measured at corresponding points on 
the cycles. 
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Figure 8.4 IMS T801 ProcClockOut timing 
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8.2 



Read cycle 



Read cycle data may be set up on the bus at any time after the start of T1, but must be valid when the 
IMS T801 reads it during T4. Data can be removed any time after the rising edge of notMemCE, but must 
be off the bus no later than the middle of T1, which allows for bus turn-around time before the data lines are 
driven at the start of T2 in a processor write cycle. 

Byte addressing is carried out internally by the IMS T801 for read cycles. 



Table 8.3 Read cycle 



SYMBOL 


PARAMETER 


T801-30 


T801-25 


T801-20 


NOTE 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


TAVEL 


Address valid before chip enable low 


6 




8 




10 




1.3 


TELEH 


Chip enable low 


48 


53 


58 


64 


72 


78 


1.3 


TEHEL 


Delay before chip enable re-assertion 


14 




16 




20 




1,2,3 


TEHAX 


Address hold after chip enable high 


6 




8 




10 




1.3 


TELDrV 


Data valid from chip enable low 





34 





40 





47 


3 


TAVDrV 


Data valid from address valid 





40 





48 





57 


3 


TDrVEH 


Data setup before chip enable high 


14 




18 




25 




3 


TEHDrZ 


Data hold after chip enable high 





14 





16 





20 


3 


TWEHEL 


Write enable setup before chip enable 
low 


14 




16 




20 




3,4 


TPCHEL 


ProcClockOut high to chip enable low 


6 




8 




10 




1.3 



Notes 



1 This parameter is common to read and write cycles and to byte-wide memory accesses. 

2 These values assume back-to-back external memory accesses. 

3 Units are ns. 

4 Timing is for all four write enables notMemWrBO-3. 
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Figure 8.5 IMS T801 external read cycle 
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8.3 



Write cycle 



For write cycles the relevant bytes in memory are addressed by the write enables notMemWrBO-3. If a par- 
ticular byte is not to be written, then the corresponding data outputs are tristated. notMemWrBO addresses 
the least significant byte. 

The write enables are gated with the chip enable signal notMemCE, allowing them to be used without 
notMemCE for simple designs. 

Data may be strobed into memory using notMemWrBO-3 without the use of notMemCE, as the write enables 
go high between consecutive external memory write cycles. 

Write data is placed on the data bus at the start of T2 and removed at the end of T4. The write cycle is 
completed with notMemCE going high. 

Table 8.4 Write cycle 



SYMBOL 


PARAMETER 


T801.30 


T801-25 


T801-20 


NOTE 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


TDwVEH 


Data setup before chip enable high 


33 




40 




50 




1 


TEHDwZ 


Data hold after write 


6 


10 


8 


12 


10 


15 


1 


TDwZEL 


Write data invalid to next chip enable 


6 




8 




10 




1 


TWELEL 


Write enable setup to chip 
enable low 


-1 





-2 





-3 





1,2 


TEHWEH 


Write enable hold after chip enable 
high 





1 





2 





3 


1,2 



Notes 



1 Units are ns. 

2 Timing is for all four write enables notMemWrBO-3. 
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Figure 8.6 IMS T801 external write cycle 
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8.4 



Wait 



Taking MemWait high with the timing shown in the diagram will extend the duration of T2 by one processor 
cycle TPCLPCL. One wait state comprises the pair W1 and W2. l\/lemWait is sampled during T2, and should 
not change state in this region. If MemWait is still high when sampled in W2 then another wait period will be 
inserted. This can continue indefinitely. Internal memory access is unaffected by the number of wait states 
selected. 

The wait state generator can be a simple digital delay line, synchronised to notMemCE. The Single Wait 
State Generator circuit in figure 8.7 can be extended to provide two or more wait states, as shown in figure 
8.8. 

Table 8.5 Memory wait 



SYMBOL 


PARAMETER 


T801-30 


T801-25 


T801-20 


NOTE 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


TPCHWtH 

TAVWtH 

TPCHWtL 


MemWait asserted after 
ProcClockOut high 

MemWait asserted after 
Address valid 

Wait low after 
ProcClockOut high 


22 


16 
16 


28 


20 
20 


35 


25 
25 


1 
1 
1 



Notes 



1 Units are ns. 
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Figure 8.7 IMS T801 memory wait timing 
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Figure 8.7 Single wait state generator 



notMemCE — [>0- 



1/6 74F04 



C 



GND- 



ProcCiockOut 



S 

R 

D Q 

CP 



1/2 74F74 



R 

D Q 

CP 



1/2 74F74 



VCC 



-^ IVIemWait 



I 



Figure 8.8 Extendable wait state generator 
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8.5 



Direct memory access 



Direct memory access (DMA) can be requested at any time by driving the asynchronous [\/lemReq input 
high. IVIemReq is sampled during T1 of the processor cycle and the DMA device will then have control of 
the bus at the beginning of the next processor cycle, (after one ProcClockOut for internal accesses and 
two ProcClockOut cycles for external memory accesses, without wait states). When the processor transfers 
control of the bus the signals MemA2-31, notMemWrBO-3 and notMemCE are tristated and MemGranted 
is asserted high. MemGranted follows the timing of the bus being tristated and can be used to signal to 
the device requesting the DMA that it has control of the bus. Note that MemGranted changes on the falling 
edge of ProcClockOut and can therefore be sampled to establish control of the bus on the rising edge of 
ProcClockOut. During the DMA cycles, MemReq is sampled during each high phase of ProcClockOut and 
after it is taken low, control of the bus will be returned to the processor within two ProcClockOut cycles. 

The processor is still able to access its internal memory while the DMA transfer proceeds, however when an 
external memory request is made the processor is forced to wait until the end of the DMA request. The DMA 
device has no access to the transputer's internal memory 

While control of the bus is being transferred from the processor to the DMA device, an extra clock phase, 
(one quarter of a ProcClockOut cycle) is allowed before the DMA transfer begins to ensure that the not- 
MemCE and notMemWrBO-3 signals have been driven high before being tristated. This normally removes 
the requirement for external pull-up resistors. 

DMA allows a bootstrap program to be loaded into external memory for execution after reset. If MemReq 
is asserted high during reset, MemGranted will be asserted high allowing access to the external memory 
before the bootstrap sequence begins. MemReq must be asserted for at least one period of TDCLDCL of 
Clockln before Reset is asserted. The DMA control circuitry should ensure that correct operation will result 
if Reset should interrupt a normal DMA cycle. 
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Figure 8.9 IMS T801 DMA sequence at reset 
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Table 8.6 Memory request 



SYMBOL 


PARAMETER 


T801-30 


T801-25 


T801-20 


NOTE 


MIN 


MAX 


MIN 


MAX 


MIN 


MAX 


TMRHMGH 


Memory request response time 


58 


a 


70 


a 


85 


a 


1,2 


TMRLMGL 


Memory request end response 
time 


60 


66 


75 


80 


90 


100 


2 


TAZMGH 


Address bus tristate before 
MemGranted 

















2 


TDZMGH 


Data bus tristate before 
MemGranted 

















2 



Notes 



1 Maximum response time a depends on whether an external memory cycle is in progress. Maximum time is 
(2 processor cycles) + (number of wait state cycles) for word access. 

2 Units are ns. 
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Figure 8.10 IMS T801 memory request timing 
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9 Events 

EventReq and EventAck provide an asynchronous handshake interface between an external event and an 
internal process. When an external event takes EventReq high the external event channel (additional to the 
external link channels) is made ready to communicate with a process. When both the event channel and the 
process are ready the processor takes EventAck high and the process, if waiting, is scheduled. EventAck 
is removed after EventReq goes low. 

EventWaiting is asserted high by the transputer when a process executes an input on the event channel; 
typically with the occam event ? ANY instruction. It remains high whilst the transputer is waiting for or 
servicing EventReq and is returned low when EventAck goes high. The EventWaiting pin changes near the 
falling edge of ProcClockOut and can therefore be sampled by the rising edge of ProcClockOut. 

The EventWaiting pin can only be asserted by executing an in instruction on the event channel. The 
EventWaiting pin is not asserted high when an enable channel {enbc) instruction is executed on the Event 
channel (during an ALT construct in occam, for example). The EventWaiting pin can be asserted by executing 
the Occam input on the event channel (such as Event ? ANY), provided that this does not occur as a 
guard in an alternative process. The EventWaiting pin can not be used to signify that an alternative process 
(ALT) is waiting on an input from the event channel. 

EventWaiting allows a process to control external logic; for example, to clock a number of inputs into a 
memory mapped data latch so that the event request type can be determined. 

Only one process may use the event channel at any given time. If no process requires an event to occur 
EventAck will never be taken high. Although EventReq triggers the channel on a transition from low to high, 
it must not be removed before EventAck is high. EventReq should be low during Reset; if not it will be 
ignored until it has gone low and returned high. EventAck is taken low when Reset occurs. 

If the process is a high priority one and no other high priority process is running, the latency is as described 
on page 136. Setting a high priority task to wait for an event input allows the user to interrupt a transputer 
program running at low priority. The time taken from asserting EventReq to the execution of the microcode 
interrupt handler in the CPU is four cycles. The following functions take place during the four cycles: 

Cycle 1 Sample EventReq at pad on the rising edge of ProcClockOut and synchronise. 

Cycle 2 Edge detect the synchronised EventReq and form the interrupt request. 

Cycle 3 Sample interrupt vector for microcode ROM in the CPU. 

Cycle 4 Execute the interrupt routine for Event rather than the next instruction. 
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Table 9.1 Event 



SYIVIBOL 


PARAIVIETER 


IVIIN 


NOIVl 


IVIAX 


UNITS 


NOTE 


TVHKH 

TKHVL 

TVLKL 

TKLVH 

TKHEWL 

TKLEWH 


Event request response 

Event request hold 

Delay before removal of event acknowledge 

Delay before re-assertion of event request 

Event acknowledge to end of event waiting 

End of event acknowledge to event waiting 












a+5 


ns 
ns 
ns 
ns 
ns 
ns 


1 



Notes 



1 a is 3 processor cycles TPCLPCL. 
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Figure 9.1 IMS T801 event timing 
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10 



Links 



Four identical INMOS bi-directional serial links provide synchronized communication between processors 
and with the outside world. Each link comprises an input channel and output channel. A link between two 
transputers is implemented by connecting a link interface on one transputer to a link interface on the other 
transputer. Every byte of data sent on a link is acknowledged on the input of the same link, thus each signal 
line carries both data and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged data byte 
and that the receiving link is able to receive another byte. The sending link reschedules the sending process 
only after the acknowledge for the final byte of the message has been received. 

The IMS T801 links allow an acknowledge packet to be sent before the data packet has been fully received. 
This overlapped acknowledge technique is fully compatible with all other INMOS transputer links. 

The IMS T801 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they 
can be used at 20 Mbits/sec for IMS T801-20 and IMS T801-25. Links are not synchronised with Clockln 
or ProcClockOut and are insensitive to their phases. Thus links from independently clocked systems may 
communicate, providing only that the clocks are nominally identical and within specification. 

Links are TTL compatible and Intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 Ohm 
transmission line should be used with series matching resistors RIVI. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

Link speeds can be set by LinkSpeed. LinkSpeed allows Links 0, 1 , 2 or 3 to be set to 10 or 20 Mbits/sec. 
Table 10.1 shows uni-directional and bi-directional data rates in Kbytes/sec for each link speed. Data rates 
are quoted for a transputer using internal memory, and will be affected by a factor depending on the number 
of external memory accesses and the length of the external memory cycle. 

Table 10.1 Speed Settings for Transputer Links 



Link 
Special 


i\/lbits/sec 


Kbytes/sec 


Uni 


Bi 




1 


10 
20 


910 
1740 


1250 
2350 





H L, 
|Ack| 


H'H 12 3 4 5 6|7 L, 


1 Data 1 



Figure 10.1 IMS T801 link data and acknowledge packets 
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Table 10.2 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




lOMbits/s 






10 


ns 


2 


CLIZ 


Linkin capacitance @ f=1MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for 10017 transmission line 




56 




ohms 





Notes 



1 These parameters are sampled, but are not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 
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Figure 10.2 IMS T801 link timing 





LinkOut 1 .5V - - ^ 


/ 


Latest TJQJD -*- 
Earliest TJQJD *- 




■< 


"*" 


Linkin 1.5V -^ 


^Z 





TJBskew -*- 





Figure 10.3 IMS T801 buffered link timing 
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Figure 10.4 IMS T801 Links directly connected 
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Figure 10.5 IMS T801 Links connected by transmission line 
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Figure 10.6 IMS T801 Links connected by buffers 
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11.1 DC electrical characteristics 



Table 1 1 .1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 





7.0 


V 


1,2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1,2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



1 AH voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 1 1 .2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 

VI, vo 

CL 
TA 


DC supply voltage 
Input or output voltage 
Load capacitance on any pin 
Operating temperature range 


4.75 





5.25 

VCC 

60 

70 


V 
V 
PF 
°C 


1 
1.2 

3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 1 1 .3 DC characteristics 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VCC+0.5 


V 


1.2 


VIL 


Low level Input voltage 


-0.5 


0.8 


V 


1,2 


II 


Input current @ GND<VI<VCC 




±10 


/iA 


1,2 


VOH 


Output high voltage @ IOH=2mA 


VCC-1 




V 


1,2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1.2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1,2,3.6 






65 


100 


mA 


1 .2,4.6 


lOZ 


Tristate output current @ GND<VO<VCC 




±10 


mA 


1,2 


PD 


Power dissipation 




1.2 


w 


2.5 


CIN 


Input capacitance @ f=1 MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=l MHz 




10 


PF 


6 



Notes 



1 All voltages are with respect to GND. 

2 Parameters for IMS T801-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs excluding ProcClockOut. 

4 Current sourced from link outputs and ProcClockOut. 

5 Power dissipation varies with output loading and program execution. 
Power dissipation for processor operating at 20 MHz. 

6 This parameter is sampled and not 100% tested. 



11.2 Equivalent circuits 



VCC 



Output 



R1 



-W- 



50pF dfs I |R2 
GND - 



Load for: 


R1 


R2 


Equivalent load: 


Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky TTL inputs 



Diodes are 1N916 



Figure 11.1 Load circuit for AC measurements 
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Inputs 



Vdd-1 



OV 
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OV 
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Figure 1 1 .2 AC measurements timing waveforms 
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Figure 1 1 .3 Tristate load circuit for AC measurements 
11.3 AC timing characteristics 

Table 1 1 .4 Input, output edges 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


TDr 
TDf 
TQr 
TQf 


Input rising edges 
Input falling edges 
Output rising edges 
Output falling edges 


2 
2 


20 
20 
25 
15 


ns 
ns 
ns 
ns 


1,2,3 

1,2,3 

1,4 

1,4 



Notes 



1 Non-link pins; see section on links. 

2 All inputs except Clockln; see section on Clockln. 

3 These parameters are not tested. 

4 These parameters are sampled, but are not 100% tested. 
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Figure 1 1 .4 IMS T801 input and output edge timing 
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Figure 1 1 .5 Typical rise/fall times 



Notes 



1 Skew is measured between notl\/lemCE with a standard load (2 Schottky TTL inputs and 30 pF) and notl\AemCE 
with a load of 2 Schottky TTL inputs and varying capacitance. 



11.4 Power rating 

Internal power dissipation P/jvr of transputer and peripheral chips depends on VCC, as shown in figure 1 1 .6. 
Pint is substantially Independent of temperature. 

Total power dissipation Pd of the chip is 

Pd = Pint + Pio 
where Pio is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj^Ta^BJa^Pd 

where Ta is the external ambient temperature in °C and BJa is the junction-to-ambient thermal resistance in 
°C/W. BJa for each package is given in the Packaging Specifications section. 
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Figure 1 1 .6 IMS T801 internal power dissipation vs VCC 
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Figure 1 1 .7 IMS T801 typical power dissipation with processor speed 
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12 



Performance 



The performance of the transputer is measured in terms of the number of bytes required for the program, and 
the number of (internal) processor cycles required to execute the program. The figures here relate to OCCam 
programs. For the same function, other languages should achieve approximately the same performance as 
Occam. 

With transputers Incorporating an FPU, this type of performance calculation is straight fonA/ard when consider- 
ing only Integer data types. However, when floating point calculations using the REAL32 and REAL64 data 
types are present in the program, complications arise due to the concurrency Inherent in the transputer's de- 
sign whereby integer calculations can be overlapped with floating point calculations. A more comprehensive 
guide to the impact of this concurrency on transputer performance can be found in the Transputer Instruction 
Set - A Compiler Writers' Guide. 



12.1 



Performance overview 



These figures are averages obtained from detailed simulation, and should be used only as an Initial guide; 
they assume operands are of type INT. The abbreviations In table 12.1 are used to represent the quantities 
indicated. In the replicator section of the table, figures in braces {} are not necessary if the number cf 
replications is a compile time constant. To estimate performance, add together the time for the variable 
references and the time for the operation. 



Table 12.1 Key to performance table 



np 

ne 

r 

ts 

w 

P 

Eg 

Et 

Tb 

Tbp 

Tbc 

nsp 

nap 



number of component processes 

number of processes earlier in queue 

1 if INT parameter or array parameter, if not 

number of table entries (table size) 

width of constant in nibbles 

number of places to shift 

expression used in a guard 

timer expression used in a guard 

most significant bit set of multiplier ((-1) If the multiplier Is 0) 

most significant bit set in a positive multiplier when counting from zero ((-1) If the multiplier is 0) 

most significant bit set in the two's complement of a negative multiplier 

Number of scalar parameters in a procedure 

Number of array parameters in a procedure 
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Table 12.2 Performance 





Size (bytes) 


Time (cycles) 


Names 

variables 

in expression 
assigned to or input to 
in PROC or FUNCTION call, 
corresponding to an INT parameter 

channels 


1.1+r 
1.1 +r 

1.1+r 
1.1 


2.1+2(r) 
1.1 +(r) 

1.1 +(r) 
2.1 


Array Variables (for single dimension arrays) 
constant subscript 
variable subscript 
expression subscript 




5.3 
5.3 




7.3 
7.3 


Declarations 

CHAN OF protocol 
[size] CHAN OF protocol 
PROC 


3.1 

9.4 

body+2 


3.1 

2.2 + 20.2*size 




Primitives 

assignment 

input 

output 

STOP 

SKIP 



4 
1 
2 





26.5 

26 

25 




Arithmetic operators 
+ 

• 

/ 

REM 

» « 


1 
2 
2 
2 
2 


1 

39 
40 
38 
3+p 


Modulo Arithmetic operators 
PLUS 
MINUS 

TIMES (fast multiply, positive operand) 
TIMES (fast multiply, negative operand) 


2 

1 
1 
1 


2 

1 
4+Tbp 
5+Tbc 


Boolean operators 
OR 
AND NOT 


4 

1 


8 
2 


Comparison operators 
= constant 
= variable 
<> constant 
<> variable 
> < 
>= <= 



2 
1 
3 

1 
2 


1 
3 
3 
5 
2 
4 


Bit operators 

/\ \/ X - 


2 


2 


Expressions 

constant in expression 
check if error 


w 

4 


w 

6 
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Table 12.3 Performance 





Size (bytes) 


Time (cycles) 


Timers 






timer input 


2 


3 


timer AFTER 






if past time 


2 


4 


with empty timer queue 


2 


31 


non-empty timer queue 


2 


38+ne*9 


ALT (timer) 






witli empty timer queue 


6 


52 


non-empty timer queue 


6 


59+ne*9 


timer alt guard 


8+2Eg+2Et 


34+2Eg+2Et 


Constructs 






SEQ 








IF 


1.3 


1.4 


if guard 


3 


4.3 


ALT (non timer) 


6 


26 


alt channel guard 


10.2+2Eg 


20+2Eg 


skip alt guard 


8+2Eg 


10+2Eg 


PAR 


11.5+(np-1)*7.5 


19.5+(np-1)*30.5 


WHILE 


4 


12 


Procedure or function call 








3.5+(nsp-2)*1.1 


16.5+(nsp-2)*1.1 




+nap*2.3 


+nap*2.3 


Replicators 






replicated SEQ 


7.3{+5.1} 


(-3.8)+15.1*count{+7.1} 


replicated if 


12.3{+5.1} 


(-2.6)+19.4*count{+7.1} 


replicated ALT 


24.8{+10.2} 


25.4+33.4*count{+14.2} 


replicated timer ALT 


24.8{+10.2} 


62.4+33.4*C0unt{+14.2} 


replicated par 


39.1 {+5.1} 


(-6.4)+70.9*count{+7.1} 



12.2 Fast multiply, TIMES 

The IMS T801 has a fast integer multiplication instruction product. For a positive multiplier its execution time 
is 4+Tbp cycles, and for a negative multiplier 5+Tbc cycles (table 12.1). The time taken for a multiplication 
by zero is 3 cycles. 

Implementations of high level languages on the transputer may take advantage of this instruction. For example, 
the Occam modulo arithmetic operator times is implemented by the instruction and the right-hand operand is 
treated as the multiplier. The fast multiplication instruction is also used in high level language implementations 
for the multiplication implicit in multi-dimensional array access. 
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12.3 



Arithmetic 



A set of functions are provided within tine development system to support the efficient implementation of 
multiple length integer arithmetic. In the IMS T801, floating point arithmetic is taken care of by the FPU. In 
table 12.4 n gives the number of places shifted and all arguments and results are assumed to be local. Full 
details of these functions are provided in the Occam reference manual, supplied as part of the development 
system and available as a separate publication. 

When calculating the execution time of the predefined maths functions, no time needs to be added for calling 
overhead. These functions are compiled directly into special purpose instructions which are designed to 
support the efficient implementation of multiple length integer arithmetic and floating point arithmetic. 

Table 12.4 Arithmetic performance 









+ cycles for 


Function 




Cycles 


parameter access f 


LONGADD 




2 


7 


LONGSUM 




3 


8 


LONGSUB 




2 


7 


LONGDIFF 




3 


8 


LONGPROD 




34 


8 


LONGDIV 




36 


8 


SHIFTRIGHT 


(n<32) 


4+n 


8 




(n>=32) 


n-27 




SHIFTLEFT 


(n<32) 


4+n 


8 




(n>=32) 


n-27 




NORMALISE 


(n<32) 


n+6 


7 




(n>=32) 


n-25 






(n=64) 


4 




ASHIFTRIGHT 




SHIFTRIGHT+2 


5 


ASHIFTLEFT 




SHIFTT.F.FT+4 


5 


ROTATERIGHT 




SHIFTRIGHT 


7 


ROTATELEFT 




SHIFTLEFT 


7 


FRACMUL 




LONGPROD+4 


5 



t Assuming local variables. 
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12.4 Floating point operations 

All references to REAL32 or REAL64 operands within programs compiled for the IMS T801 normally produce 
the following performance figures. 

Table 12.5 Floating point performance 





Size (bytes) 


REAL32 Time (cycles) 


REAL64 Time (cycles) 


Names 








variables 








in expression 


3.1 


3 


5 


assigned to or input to 


3.1 


3 


5 


in PROC or FUNCTION call. 








corresponding to a REAL 








parameter 


1.1 +r 


1.1+r 


1.1+r 


Arithmetic operators 








+ 


2 


7 


7 


* 


2 


11 


20 


/ 


2 


17 


32 


REM 


11 


19 


34 


Comparison operators 








= 


2 


4 


4 


<> 


3 


6 


6 


> < 


2 


5 


5 


>= <= 


3 


7 


7 


Conversions 








REAL32 to - 


2 




3 


REAL64 to - 


2 


6 




To INT32 from - 


5 


9 


9 


To INT 6 4 from - 


18 


32 


32 


INT32 to - 


3 


7 


7 


INT64 to - 


14 


24 


22 



12.4.1 Floating point functions 

These functions are provided by the development system. They are compiled directly into special purpose 
instructions designed to support the efficient implementation of some of the common mathematical functions 
of other languages. The functions provide ABS and SQRT for both REAL32 and REAL 64 operand types. 

Table 12.6 IMS T801 floating point arithmetic performance 



Function 


Cycies 


+ cycles for parameter access f 


REAL32 


REAL64 


ABS 


2 


8 




SQRT 


118 


8 




DABS 


2 




12 


DSQRT 


244 




12 



t Assuming local variables. 
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12.4.2 Special purpose functions and procedures 

The functions and procedures given in tables 12.8 and 12.9 are provided by tine development system to give 
access to the special instructions available on the IMS T801. Table 12.7 shows the key to the table. 



Table 12.7 Key to special performance table 



Tb 
n 
r 
nr 



most significant bit set in the word counting from zero 
number of words per row (consecutive memory locations) 
number of rows in the two dimensional move 
number of bits to reverse 



Table 12.8 Special purpose functions performance 



Function 


Cycles 


+ cycles for 
parameter access f 


BITCOUNT 


2+Tb 


2 


CRCBYTE 


11 


8 


CRCWORD 


35 


8 


BITREVNBIT 


5+nr 


4 


BITREVWORD 


36 


2 



t Assuming local variables. 



Table 12.9 Special purpose procedures performance 



Procedure 


Cycles 


+ cycles for 
parameter access f 


MOVE 2D 
DRAW2D 
CLIP2D 


8+(2n+23)*r 
8+(2n+23)*r 
8+(2n+23)*r 


8 
8 
8 



t Assuming local variables. 



12.5 Effect of external memory 

Extra processor cycles may be needed when program and/or data are held in external memory, depending 
both on the operation being performed, and on the speed of the external memory. After a processor cycle 
which initiates a write to memory, the processor continues execution at full speed until at least the next 
memory access. 

Whilst a reasonable estimate may be made of the effect of external memory, the actual performance will 
depend upon the exact nature of the given sequence of operations. 

External memory is characterized by the number of extra processor cycles per external memory cycle, denoted 
as e. The value of e for the IMS T801 is greater than or equal to 1 . 

If a program is stored in external memory, and e has the value 2 or 3, then no extra cycles need be estimated 
for linear code sequences. For larger values of e, the number of extra cycles required for linear code 
sequences may be estimated at (e-3)/4. A transfer of control may be estimated as requiring e+3 cycles. 

These estimates may be refined for various constructs. In table 12.10 n denotes the number of components 
in a construct. In the case of IF, the n'th conditional is the first to evaluate to TRUE, and the costs include the 
costs of the conditionals tested. The number of bytes in an array assignment or communication is denoted 
byb. 
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Table 12.10 External memory performance 





IMS T801 1 


Program off chip 


Data off chip 


Boolean expressions 


e-2 





IF 


3en-8 


en 


Replicated IF 


(6e-4)n+7 


(5e-2)n+8 


Replicated SEQ 


(3e-3)n+2 


(4e-2)n 


PAR 


(3e-1)n+8 


3en+4 


Replicated par 


(10e-8)n+8 


16en-12 


ALT 


(2e-4)n+6e 


(2e-2)n+10e-8 


Array assignment and 





max (2e, e(b/2)) 


communication in 






one transputer 







The following simulation results illustrate the effect of storing program and/or data in external memory. The 
results are normalized to 1 for both program and data on chip. The first program (Sieve of Erastosthenes) 
is an extreme case as it is dominated by small, data access intensive loops; it contains no concurrency, 
communication, or even multiplication or division. The second program is the pipeline algorithm for Newton 
Raphson square root computation. 

Table 12.11 IMS T801 external memory performance 





Program 


e=2 


e=3 


e=4 


e=5 


On chip 


Program off chip 


1 
2 


1.3 
1.1 


1.5 
1.2 


1.7 
1.2 


1.9 
1.3 




Data off chip 


1 
2 


1.5 
1.2 


1.8 
1.4 


2.1 
1.6 


2.3 

1.7 




Program and data off chip 


1 
2 


1.8 
1.3 


2.2 

1.6 


2.7 
1.8 


3.2 
2.0 





12.6 Interrupt latency 

If the process is a high priority one and no other high priority process is running, the latency is as described 
in table 12.12. The timings given are in full processor cycles TPCLPCL; the number of Tm states is also 
given where relevant. Maximum latency assumes all memory accesses are internal ones. 



Table 12.12 Interrupt latency 





Typical 


Maximum 


TPCLPCL 


Tm 


TPCLPCL 


Tm 


IMS T801 with FPU in use 
IMS T801 with FPU not in use 


19 
19 


38 
38 


78 
58 


156 
116 
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13.1 100 pin grid array package 
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Figure 13.1 IMS T801 100 pin grid array package pinout - top view 
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Figure 13.2 100 pin grid array pacl<age dimensions 



Table 13.1 100 pin grid array package dimensions 



DilVI 


Millimetres 


Indies 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B1 


17.019 


±0.127 


0.670 


±0.005 




82 


18.796 


±0.127 


0.740 


±0.005 




C 


2.456 


±0.278 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.051 


0.018 


±0.002 


Pin diameter 


G 


1.143 


±0.127 


0.045 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Table 13.2 100 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


(9JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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14 Ordering 

This section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for ail parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 



Table 14.1 IMS T801 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IMS T801-G20S 
IMS T801-G25S 
IMS T801-G30S 

IMS T801-G20M 


20.0 MHz 
25.0 MHz 
30.0 MHz 

20.0 MHz 


50 ns 
40 ns 
33 ns 

50 ns 


4.0 
5.0 
6.0 

4.0 


Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 

Ceramic Pin Grid 
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30 MIPS (peak) instruction rate 
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IEEE 754 
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120 Mbytes/sec sustained data rate to internal memory 
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Four INMOS serial links 5/10/20 Mbits/sec 

Bi-directional data rate of 2.4 Mbytes/sec per link 

High performance graphics support with block move 
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Single 5 MHz clock Input 

Single -h5V ±5% power supply 

MIL-STD-883C processing is available 

APPLICATIONS 
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High speed multi processor systems 
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1 Introduction 

The IMS T800 transputer is a 32 bit CMOS microcomputer with a 64 bit floating point unit and graphics support. 
It has 4 Kbytes on-chip RAM for high speed processing, a configurable memory interface and four standard 
INMOS communication links. The instruction set achieves efficient implementation of high level languages 
and provides direct support for the Occam model of concurrency when using either a single transputer or a 
network. Procedure calls, process switching and typical interrupt latency are sub-microsecond. 

For convenience of description, the IMS T800 operation is split into the basic blocks shown in figure 1.1. 



vcc 
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Figure 1.1 IMS T800 block diagram 



The processor speed of a device can be pin-selected in stages from 17.5 MHz up to the maximum allowed 
for the part. A device running at 30 MHz achieves an instruction throughput of 30 MIPS peak and 15 MIPS 
sustained. The extended temperature version of the device complies with MIL-STD-883C. 

The IMS T800 provides high performance arithmetic and floating point operations. The 64 bit floating point unit 
provides single and double length operation to the ANSI-IEEE 754-1985 standard for floating point arithmetic. 
It Is able to perform floating point operations concurrently with the processor, sustaining a rate of 2.2 Mflops 
at a processor speed of 20 MHz and 3.3 Mflops at 30 MHz. 
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High performance graphics support is provided by microcoded block move instructions which operate at the 
speed of memory. The two-dimensional block move instructions provide for contiguous block moves as well 
as block copying of either non-zero bytes of data only or zero bytes only. Block move instructions can be used 
to provide graphics operations such as text manipulation, windowing, panning, scrolling and screen updating. 

Cyclic redundancy checking (CRC) instructions are available for use on arbitrary length serial data streams, 
to provide error detection where data integrity is critical. Another feature of the IMS T800, useful for pattern 
recognition, is the facility to count bits set in a word. 

The IMS T800 can directly access a linear address space of 4 Gbytes. The 32 bit wide memory interface 
uses multiplexed data and address lines and provides a data rate of up to 4 bytes every 100 nanoseconds 
(40 Mbytes/sec) for a 30 MHz device. A configurable memory controller provides all timing, control and DRAM 
refresh signals for a wide variety of mixed memory systems. 

System Services include processor reset and bootstrap control, together with facilities for error analysis. Error 
signals may be daisy-chained in multi-transputer systems. 

The standard INMOS communication links allow networks of transputer family products to be constructed by 
direct point to point connections with no external logic. The IMS T800 links support the standard operating 
speed of 10 Mbits/sec, but also operate at 5 or 20 Mbits/sec. Each link can transfer data bi-directionally at 
up to 2.35 Mbytes/sec. 

The transputer is designed to implement the Occam language, detailed In the OCCam Reference Manual, but 
also efficiently supports other languages such as C, Pascal and Fortran. Access to the transputer at machine 
level is seldom required, but if necessary refer to the Transputer Instruction Set - A Connpiler Writers' Guide. 

This data sheet supplies hardware implementation and characterisation details for the IMS T800. It is intended 
to be read in conjunction with the Transputer Architecture chapter, which details the architecture of the 
transputer and gives an overview of OCCam. 
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Pin designations 



Table 2.1 IMS T800 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPlus, CapMinus 




External capacitor for internal clock power supply 


Clockln 


in 


Input clock 


ProcSpeedSelectO-2 


in 


Processor speed selectors 


Reset 


in 


System reset 


Error 


out 


Error indicator 


Errorin 


in 


Error daisychain input 


Analyse 


in 


Error analysis 


BootFromRom 


in 


Boot from external ROM or from link 


DisableintRAM 


in 


Disable internal RAM 


DoNotWire 




Must not be wired 



Table 2.2 IMS T800 external memory interface 



Pin 


In/Out 


Function 


ProcClockOut 


out 


Processor clock 


MemnotWrDO 


in/out 


Multiplexed data bit and write cycle warning 


MemnotRfDI 


in/out 


Multiplexed data bit 1 and refresh warning 


MemAD2-31 


in/out 


Multiplexed data and address bus 


notMemRd 


out 


Read strobe 


notMemWrBO-3 


out 


Four byte-addressing write strobes 


notMemSO-4 


out 


Five general purpose strobes 


notMemRf 


out 


Dynamic memory refresh indicator 


MemWalt 


in 


Memory cycle extender 


MemReq 


in 


Direct memory access request 


MemG ranted 


out 


Direct memory access granted 


MemConfIg 


in 


Memory configuration data input 



Table 2.3 IMS T800 event 



Pin 


In/Out 


Function 


EventReq 
EventAck 


in 
out 


Event request 

Event request acknowledge 



Table 2.4 IMS T800 link 



Pin 


In/Out 


Function 


LlnklnO-3 


in 


Four serial data input channels 


LlnkOutO-3 


out 


Four serial data output channels 


LinkSpeclal 


in 


Select non-standard speed as 5 or 20 Mbits/sec 


LInkOSpeclal 


in 


Select special speed for Link 


Link123Speclal 


in 


Select special speed for Links 1 ,2,3 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 256. 
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3 Processor 

The 32 bit processor contains instruction processing logic, instruction and work pointers, and an operand 
register. It directly accesses the high speed 4 Kbyte on-chip memory, which can store data or program. 
Where larger amounts of memory or programs in ROM are required, the processor has access to 4 Gbytes 
of memory via the External Memory Interface (EMI). 



3.1 



Registers 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; six registers are used in the execution of a sequential process. The small number of 
registers, together with the simplicity of the instruction set, enables the processor to have relatively simple 
(and fast) data-paths and control logic. The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The A, B and C registers which form an evaluation stack. 

A, B and C are sources and destinations for most arithmetic and logical operations. Loading a value into the 
stack pushes B into C, and A into B, before loading A. Storing a value from A, pops B into A and C into B. 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the add instruction adds the top two values in the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 

Any location in memory can be accessed relative to the workpointer register, enabling the workspace to be 
of any size. 

Further register details are given in Transputer Instruction Set - A Compiler Writers' Guide. 



Registers 




Locals 




Program 


A 








B 






C 






Workspace 








Next Inst 












Operand 





Figure 3.1 Registers 
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3.2 



Instructions 



The Instruction set has been designed for simple and efficient compilation of high-level languages. All in- 
structions have the same format, designed to give a compact representation of the operations occurring most 
frequently in programs. 

Each instruction consists of a single byte divided into two 4-bit parts. The four most significant bits of the byte 
are a function code and the four least significant bits are a data value. 





Function Data 


7 4 3^ 





Operand Register 





Figure 3.2 Instruction format 



3.2.1 



Direct functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Ten of these, 
shown In table 3.1, are used to encode the most important functions. 

Table 3.1 Direct functions 



load constant 


add constant 




load local 


store local 


load local pointer 


load non-local 


store non-local 




jump 


conditional jump 


call 



The most common operations in a program are the loading of small literal values and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local and store /oca/ instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of high level programming languages such as Occam, C, Fortran, Pascal or ADA. 

3.2.2 Prefix functions 



Two more function codes allow the operand of any instruction to be extended in length; prefix and negative 
prefix. 

All instructions are executed by loading the four data bits Into the least significant four bits of the operand 
register, which is then used as the instruction's operand. All instructions except the prefix instructions end by 
clearing the operand register, ready for the next instruction. 

The prefix instruction loads its four data bits into the operand register and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix Instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 
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The use of prefix Instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form independent of the processor 
wordlength. 

3.2.3 Indirect functions 

The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate instruction just like any 
other. The Instruction representation therefore provides for an indefinite number of operations. 

Encoding of the indirect functions is chosen so that the most frequently occurring operations are represented 
without the use of a prefix instruction. These Include arithmetic, logical and comparison operations such as 
add, exclusive or and greater than. Less frequently occurring operations have encodings which require a 
single prefix operation. 

3.2.4 Expression evaluation 

Evaluation of expressions sometimes requires use of temporary variables in the workspace, but the number 
of these can be minimised by careful choice of the evaluation order. 

Table 3.2 Expression evaluation 



Program 


Mnemonic 


X :=0 


Idc 
stl X 


X := #24 


pfix 2 
Idc 4 
stl X 


X := y + z 


Idl y 
Idl z 
add 
stl X 



3.2.5 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte; that is, without 
the use of prefix Instructions. Many of these instructions, such as load constant and add require just one 
processor cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional Instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive four instructions for every fetch. 

Short instructions also improve the effectiveness of Instruction pre-fetch, which in turn improves processor 
performance. There is an extra word of pre-fetch buffer, so the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer Is short, there is little time penalty when a jump Instruction causes 
the buffer contents to be discarded. 
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3.3 



Processes and concurrency 



A process starts, performs a number of actions, and then either stops without completing or terminates 
complete. Typically, a process is a sequence of instructions. A transputer can run several processes in 
parallel (concurrently). Processes may be assigned either high or low priority, and there may be any number 
of each (page 197). 

The processor has a microcoded scheduler which enables any number of concurrent processes to be exe- 
cuted together, sharing the processor time. This removes the need for a software kernel. 



At any time, a concurrent process may be 



Active 



Inactive 



Being executed. 

On a list waiting to be executed. 

Ready to input. 

Ready to output. 

Waiting until a specified time. 



The scheduler operates in such a way that inactive processes do not consume any processor time. It allocates 
a portion of the processor's time to each process in turn. Active processes waiting to be executed are held 
in two linked lists of process workspaces, one of high priority processes and one of low priority processes 
(page 197). Each list is implemented using two registers, one of which points to the first process in the list, 
the other to the last. In the Linked Process List figure 3.3, process S is executing and P, Q and R are active, 
awaiting execution. Only the low priority process queue registers are shown; the high priority process ones 
perform in a similar manner. 
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Figure 3.3 Linked process list 



Table 3.3 Priority queue control registers 



Function 


High Priority 


Low Priority 


Pointer to front of active process list 
Pointer to back of active process list 


FptrO 
BptrO 


Fptn 
Bptn 



Each process runs until it has completed its action, but is descheduled whilst waiting for communication from 
another process or transputer, or for a time delay to complete. In order for several processes to operate in 
parallel, a low priority process is only permitted to run for a maximum of two time slices before it is forcibly 
descheduled at the next descheduling point (page 201). The time slice period is 5120 cycles of the external 
5 MHz clock, giving ticks approximately 1 ms apart. 
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A process can only be descheduled on certain instructions, known as descheduling points (page 201). As a 
result, an expression evaluation can be guaranteed to execute without the process being timesliced part way 
through. 

Whenever a process is unable to proceed, its instruction pointer is saved in the process workspace and 
the next process taken from the list. Process scheduling pointers are updated by instructions which cause 
scheduling operations, and should not be altered directly. Actual process switch times are less than 1 /iS, as 
little state needs to be saved and it is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model, including start process 
and end process. When a main process executes a parallel construct, start process instructions are used 
to create the necessary additional concurrent processes. A start process instruction creates a new process 
by adding a new workspace to the end of the scheduling list, enabling the new concurrent process to be 
executed together with the ones already being executed. When a process is made active it is always added 
to the end of the list, and thus cannot pre-empt processes already on the same list. 

The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the parallel construct components which have still to terminate. The 
counter is initialised to the number of components before the processes are started. Each component ends 
with an end process instruction which decrements and tests the counter. For all but the last component, the 
counter is non zero and the component is descheduled. For the last component, the counter is zero and the 
main process continues. 

3.4 Priority 

The IMS T800 supports two levels of priority. Priority 1 (low priority) processes are executed whenever there 
are no active priority (high priority) processes. 

High priority processes are expected to execute for a short time. If one or more high priority processes are 
able to proceed, then one is selected and runs until it has to wait for a communication, a timer input, or until 
it completes processing. 

If no process at high priority is able to proceed, but one or more processes at low priority are able to proceed, 
then one is selected. 

Low priority processes are periodically timesliced to provide an even distribution of processor time between 
computationally intensive tasks. 

If there are n low priority processes, then the maximum latency from the time at which a low priority process 
becomes active to the time when it starts processing is 2n-2 timeslice periods. It is then able to execute for 
between one and two timeslice periods, less any time taken by high priority processes. This assumes that 
no process monopolises the transputer's time; i.e. it has a distribution of descheduling points (page 201). 

Each timeslice period lasts for 5120 cycles of the external 5 MHz input clock (approximately 1 ms at the 
standard frequency of 5 MHz). 

If a high priority process is waiting for an external channel to become ready, and if no other high priority 
process is active, then the Interrupt latency (from when the channel becomes ready to when the process 
starts executing) is typically 19 processor cycles, a maximum of 78 cycles (assuming use of on-chip RAM). 
If the floating point unit is not being used at the time then the maximum interrupt latency is only 58 cycles. 
To ensure this latency, certain instructions are interruptable. 

3.5 Communications 

Communication between processes is achieved by means of channels. Process communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 
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A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
links. The processor provides a number of operations to support message passing, the most Important being 
input message and output message. 

The input message and ouput message instructions use the address of the channel to determine whether 
the channel is internal or external. Thus the same instruction sequence can be used for both, allowing a 
process to be written and compiled without knowledge of where its channels are connected. 

The process which first becomes ready must wait until the second one Is also ready. A process performs an 
Input or output by loading the evaluation stack with a pointer to a message, the address of a channel, and 
a count of the number of bytes to be transferred, and then executing an input message or output message 
Instruction. Data is transferred if the other process Is ready. If the channel is not ready or is an external one 
the process will deschedule. 

3.6 Bioclc move 

The block move on the transputer moves any number of bytes from any byte boundary In memory, to any 
other byte boundary, using the smallest possible number of word read, and word or part-word writes. 

A block move instruction can be Interrupted by a high priority process. On interrupt, block move is completed 
to a word boundary. Independent of start position. When restarting after interrupt, the last word written Is 
written again. This appears as an unnecessary read and write in the simplest case of word aligned block 
moves, and may cause problems with FIFOs. This problem can be overcome by incrementing the saved 
destination (BreglntSaveLoc) and source pointer (CreglntSaveLoc) values by BytesPerWord during the high 
priority process. 

3.7 Timers 

The transputer has two 32 bit timer clocks which 'tick' periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a specific time. 

One timer is accessible only to high priority processes and is Incremented every microsecond, cycling com- 
pletely in approximately 4295 seconds. The other is accessible only to low priority processes and is incre- 
mented every 64 microseconds, giving exactly 15625 ticks in one second. It has a full period of approximately 
76 hours. 

Table 3.4 Timer registers 



Clocl<0 
Clocl<1 
TNextRegO 
TNextRegl 



Current value of high priority (level 0) process clock 

Current value of low priority (level 1) process clock 

Indicates time of earliest event on high priority (level 0) timer queue 

Indicates time of earliest event on low priority (level 1) timer queue 



The current value of the processor clock can be read by executing a load timer Instruction. A process can 
arrange to perform a timer input, in which case it will become ready to execute after a specified time has 
been reached. The //mer /nptyf instruction requires a time to be specified. If this time is in the 'past' then the 
instruction has no effect. If the time Is In the 'future' then the process is descheduled. When the specified 
time is reached the process is scheduled again. 



3 Processor 



199 



Figure 3.4 shows two processes waiting on the timer queue, one 


waiting for time 21 , the other for time 31 . 
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Figure 3.4 Timer registers 
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4 Instruction set summary 

The Function Codes table 4.8. gives the basic function code set (page 194). Where the operand is less 
than 16, a single byte encodes the complete instruction. If the operand is greater than 15, one prefix 
instruction (pfix) is required for each additional four bits of the operand. If the operand is negative the first 
prefix instruction will be nfix. 

Table 4.1 pre//x coding 







Function 


IVIemory 


Mnemonic 


code 


code 


Idc 


#3 


#4 


#43 


Idc 


#35 






is coded as 








pfix 


#3 


#2 


#23 


Idc 


#5 


#4 


#45 


Idc 


#987 






is coded as 








pfix 


#9 


#2 


#29 


pfix 


#8 


#2 


#28 


Idc 


#7 


#4 


#47 


Idc 


-31 {Idc 


#FFFFFFE1) 




is coded as 








nfix 


#1 


#6 


#61 


Idc 


#1 


#4 


#41 



Tables 4.9 to 4.27 give details of the operation codes. Where an operation code is less than 16 (e.g. add: 
operation code 05), the operation can be stored as a single byte comprising the operate function code F and 
the operand (5 in the example). Where an operation code is greater than 15 (e.g. ladd: operation code 16), 
the pref/x function code 2 is used to extend the instruction. 



Table 4.2 operate coding 



IVInemonic 



Function 
code 



IVIemory 
code 



add 
is coded as 

opr 

ladd 
is coded as 

pfix 
opr 



(op. C0d^#5) 

add ^ 

(op. code #1 6) 

#1 
#6 



#F 



#2 
#F 



#F5 

#F5 

#21 F6 

#21 
#F6 



In the Floating Point Operation Codes tables 4.21 to 4.27, a selector sequence code (page 209) is indicated 
in the Memory Code column by s. The code given in the Operation Code column is the indirection code, the 
operand for the Idc instruction. 

The FPU and processor operate concurrently, so the actual throughput of floating point instructions is better 
than that implied by simply adding up the instruction times. For full details see Transputer Instruction Set - A 
Compiler Writers' Guide. 
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The Processor Cycles column refers to the number of periods TPCLPCL taken by an instruction executing 
in internal memory. The number of cycles is given for the basic operation only; where the memory code 
for an instruction is two bytes, the time for the prefix function (one cycle) should be added. For a 20 MHz 
transputer one cycle is 50 ns. Some instruction times vary. Where a letter is included in the cycles column it 
is interpreted from table 4.3. 



Table 4.3 Instruction set interpretation 



ident 



interpretation 



b 
m 

n 
w 



Bit number of the highest bit set in register A. Bit is the least significant bit. 

Bit number of the highest bit set in the absolute value of register A. 
Bit is the least significant bit. 

Number of places shifted. 

Number of words in the message. Part words are counted as full words. If the message 
is not word aligned the number of words is increased to include the part words at either 
end of the message. 

Number of words per row. 

Number of rows. 



The DE column of the tables indicates the descheduling/error features of an instruction as described in 
table 4.4. 

Table 4.4 Instruction features 



ident 


Feature 


See page: 


D 

E 
F 


The instruction is a descheduling point 
The instruction will affect the Error flag 
The instruction will affect the FP.Error flag 


201 
202,216 
209, 202 



4.1 



Descheduiing points 



The instructions in table 4.5 are the only ones at which a process may be descheduled (page 196). They are 
also the ones at which the processor will halt if the Analyse pin is asserted (page 215). 



Table 4.5 Descheduling point instructions 



input message 
timer alt wait 
jump 



output message 
timer input 
loop end 



output byte 
stop on error 
end process 



output word 
alt wait 
stop process 
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4.2 



Error instructions 



The instructions in table 4.6 are tlie only ones which can affect the Error flag (page 21 6) directly. Note, 
however, that the floating point unit error flag FP^Errorls set by certain floating point instructions (page 202), 
and that Error can be set from this flag by fpcheckerror. 



Table 4.6 Error setting instructions 



add 
multiply 
long add 
set error 
check word 



add constant 

fractional multiply 

long subtract 

testerr 

check subscript from 



subtract 
divide 
long divide 
fpcheckerror 
check single 



remainder 

check count from 1 



4.3 



Floating point errors 



The instructions in table 4.7 are the only ones which can affect the floating point error flag FP_Error (page 209). 
Error is set from this flag by fpcheckerror if FP.Error is set. 



Table 4.7 Floating point error setting instructions 



fpadd 


fpsub 


fpmul 


fpdiv 


fpldnladdsn 


fpldnladddb 


fpldnlmulsn 


fpldnlmuldb 


fpremfirst 


fpusqrtfirst 


fpgt 


fpeq 


fpuseterror 


fpuclearerror 


fptesterror 




fpuexplncby32 


fpuexpdecby32 


fpumulby2 


fpudivby2 


fpur32tor64 


fpur64tor32 


fpucki32 


fpucki64 


fprtoi32 


fpuabs 


fpint 
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Table 4.8 IMS T800 function codes 



Function 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 



1 
2 
3 


OX 
1X 
2X 
3X 


J 

Idip 
pfix 
Idnl 


3 
1 
1 
2 


jump 

load local pointer 

prefix 

load non-local 


D 


4 


4X 


Idc 


1 


load constant 




5 
6 

7 


5X 
6X 
7X 


Idnip 

nfix 

Idl 


1 
1 
2 


load non-local pointer 
negative prefix 
load local 




8 


8X 


adc 


1 


add constant 


E 


9 


9X 


call 


7 


call 




A 

B 
C 
D 


AX 

BX 
OX 

DX 


cj 

ajw 
eqc 

sti 


2 

4 
1 
2 

1 


conditional jump (not taken) 
conditional jump (taken) 
adjust workspace 
equals constant 
store local 




E 


EX 


stnl 


2 


store non-local 




F 


FX 


opr 


- 


operate 





Table 4.9 IMS T800 arithmetic/logical operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


46 


24F6 


and 


1 


and 




4B 


24FB 


or 


1 


or 




33 


23F3 


xor 


1 


exclusive or 




32 


23F2 


not 


1 


bitwise not 




41 


24F1 


shI 


n+2 


shift left 




40 


24F0 


shr 


n+2 


shift right 




05 


F5 


add 


1 


add 


E 


OC 


FC 


sub 


1 


subtract 


E 


53 


25F3 


mul 


38 


multiply 


E 


72 


27F2 


fmul 


35 


fractional multiply (no rounding) 


E 








40 


fractional multiply (rounding) 


E 


20 


22FC 


div 


39 


divide 


E 


1F 


21 FF 


rem 


37 


remainder 


E 


09 


F9 


gt 


2 


greater than 




04 


F4 


diff 


1 


difference 




52 


25F2 


sum 


1 


sum 




08 


F8 


prod 


b+4 
m-i-5 


product for positive register A 
product for negative register A 
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Table 4.10 IMS T800 long arithmetic operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


16 


21 F6 


ladd 


2 


long add 


E 


38 


23F8 


Isub 


2 


long subtract 


E 


37 


23F7 


Isum 


3 


long sum 




4F 


24FF 


Idiff 


3 


long diff 




31 


23F1 


Imul 


33 


long multiply 




1A 


21 FA 


Idiv 


35 


long divide 


E 


36 


23F6 


IshI 


n+3 
n-28 


long shift left (n<32) 
long shift left(n>32) 




35 


23F5 


Ishr 


n+3 
n-28 


long shift right (n<32) 
long shift right (n>32) 




19 


21 F9 


norm 


n+5 
n-26 

3 


normalise (n<32) 
normalise (n>32) 
normalise (n=64) 





Table 4.1 1 IMS T800 general operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


00 


FO 


rev 


1 


reverse 




3A 


23FA 


xword 


4 


extend to word 




56 


25F6 


cword 


5 


check word 


E 


ID 


21 FD 


xdble 


2 


extend to double 




40 


24FC 


csngi 


3 


check single 


E 


42 


24F2 


mint 


1 


minimum integer 




5A 


25FA 


dup 


1 


duplicate top of stack 





Table 4.12 IMS T800 2D block move operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


5B 


25FB 


move2dinit 


8 


initialise data for 2D block move 




50 


25FO 


move2dall 


(2p+23)*r 


2D block copy 




5D 


25FD 


move2dnonzero 


(2p+23)*r 


2D block copy non-zero bytes 




5E 


25FE 


move2dzero 


(2p+23)*r 


2D block copy zero bytes 





Table 4.13 IMS T800 ORG and bit operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


74 
75 

76 
77 
78 


27F4 
27F5 

27F6 
2/h/ 
27F8 


crcword 
crcbyte 

bitcnt 

bitrevword 

bitrevnbits 


35 

11 

b+2 
36 
n+4 


calculate crc on word 
calculate crc on byte 

count bits set in word 
reverse bits in word 
reverse bottom n bits in word 
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Table 4.14 IMS T800 indexing/array operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


02 


F2 


bsub 


1 


byte subscript 




OA 


FA 


wsub 


2 


word subscript 




81 


28F1 


wsubdb 


3 


form double word subscript 




34 


23F4 


bent 


2 


byte count 




3F 


23FF 


went 


5 


word count 




01 


F1 


lb 


5 


load byte 




SB 


23FB 


sb 


4 


store byte 




4A 


24FA 


move 


2w+8 


move message 





Table 4.15 IMS T800 timer handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


22 


22F2 


Idtimer 


2 


load timer 




2B 


22FB 


tin 


30 


timer input (time future) 


D 








4 


timer input (time past) 


D 


4E 


24FE 


talt 


4 


timer alt start 




51 


25F1 


taltwt 


15 


timer alt wait (time past) 


D 








48 


timer alt wait (time future) 


D 


47 


24F7 


enbt 


8 


enable timer 




2E 


22FE 


dist 


23 


disable timer 





Table 4.16 IMS T800 input/output operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


07 
OB 
OF 
OE 


F7 
FB 
FF 
FE 


in 

out 

outword 
outbyte 


2W+19 
2W+19 

23 

23 


input message 
output message 
output word 
output byte 


D 
D 
D 
D 


43 

44 

45 


24F3 
24F4 

24F5 


alt 
altwt 

altend 


2 
5 

17 
4 


alt start 

alt wait (channel ready) 
alt wait (channel not ready) 
alt end 


D 
D 


49 
30 


24F9 
23F0 


enbs 
diss 


3 

4 


enable skip 
disable skip 




12 
48 

2F 


21 F2 
24F8 

22FF 


resetch 
enbc 

disc 


3 
7 
5 
8 


reset channel 
enable channel (ready) 
enable channel (not ready) 
disable channel 
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Table 4.17 IMS T800 control operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


20 
1B 
3C 
06 
21 


22F0 
21 FB 
23FC 

F6 
22F1 


ret 

Idpi 

gajw 

gcall 

lend 


5 
2 
2 

4 
10 
5 


return 

load pointer to instruction 

general adjust workspace 

general call 

loop end (loop) 

loop end (exit) 


D 
D 



Table 4.18 IMS T800 scheduling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


OD 


FD 


startp 


12 


start process 


D 


03 


F3 


endp 


13 


end process 


D 


39 


23F9 


runp 


10 


run process 




15 


21 F5 


stopp 


11 


stop process 




IE 


21 FE 


Idpri 


1 


load current priority 





Table 4.19 IMS T800 error handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


13 
4D 


21 F3 
24FD 


csubO 
ccnti 


2 
3 


check subscript from 
check count from 1 


E 
E 


29 
10 


22F9 
21 FO 


testerr 
seterr 


2 
3 

1 


test error false and clear (no error) 
test error false and clear (error) 
set error 


E 


55 
57 


25F5 
25F7 


stoperr 
cirhalterr 


2 

1 


stop on error (no error) 
clear halt-on-error 


D 


58 


25F8 


sethalterr 


1 


set halt-on-error 




59 


25F9 


testhalterr 


2 


test halt-on-error 





Table 4.20 IMS T800 processor initialisation operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


2A 


22FA 


testpranal 


2 


test processor analysing 




3E 


23FE 


saveh 


4 


save high priority queue registers 




3D 


23FD 


savel 


4 


save low priority queue registers 




18 


21 F8 


sthf 




store high priority front pointer 




50 


25F0 


sthb 




store high priority back pointer 




10 


21 FC 


stif 




store low priority front pointer 




17 


21 F7 


stib 




store low priority back pointer 




54 


25F4 


sttimer 




store timer 





4 Instruction set summary 
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Table 4.21 IMS T800 floating point load/store operation codes 



Operation 


l\/lemory 




Processor 




D 


Code 


Code 


IVInemonic 


Cycles 


Name 


E 


8E 


28FE 


fpldnlsn 


2 


fp load non-local single 




8A 


28FA 


fpldnldb 


3 


fp load non-local double 




86 


28F6 


fpldnlsni 


4 


fp load non-local indexed single 




82 


28F2 


fpldnldbi 


6 


fp load non-local indexed double 




9F 


29FF 


fpldzerosn 


2 


load zero single 




AO 


2AF0 


fpldzerodb 


2 


load zero double 




AA 


2AFA 


fpldnladdsn 


8/11 


fp load non local & add single 


F 


A6 


2AF6 


fpldnladddb 


9/12 


fp load non local & add double 


F 


AC 


2AFC 


fpldnlmulsn 


13/20 


fp load non local & multiply single 


F 


A8 


2AF8 


fpldnlmuldb 


21/30 


fp load non local & multiply double 


F 


88 


28F8 


fpstnlsn 


2 


fp store non-local single 




84 


28F4 


fpstnldb 


3 


fp store non-local double 




9E 


29FE 


fpstnli32 


4 


store non-local int32 





Processor cycles are shown as Typical/Maximum cycles. 



Table 4.22 IMS T800 floating point general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


AB 
A4 
A3 


2AFB 
2AF4 
2AF3 


fpentry 

fprev 

fpdup 


1 

1 
1 


floating point unit entry 
fp reverse 
fp duplicate 





Table 4.23 IMS T800 floating point rounding operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


22 


s 


fpurn 


1 


set rounding mode to round nearest 




06 


s 


fpurz 


1 


set rounding mode to round zero 




04 


s 


fpurp 


1 


set rounding mode to round positive 




05 


s 


fpurm 


1 


set rounding mode to round minus 





Table 4.24 IMS T800 floating point error operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


83 
90 
23 
90 


28F3 
29FC 

s 

s 


fpchkerror 
fptesterror 
fpuseterror 
fpuclearerror 


1 
2 

1 
1 


check fp error 

test fp error false and clear 

set fp error 

clear fp error 


E 
F 
F 
F 
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Table 4.25 IMS T800 floating point comparison operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


94 


29F4 


fpgt 


4/6 


fp greater than 


F 


95 


29F5 


fpeq 


3/5 


fp equality 


F 


92 


29F2 


fpordered 


3/4 


fp orderability 




91 


29F1 


fpnan 


2/3 


fpNaN 




93 


29F3 


fpnotfinite 


2/2 


fp not finite 




OE 


s 


fpuchki32 


3/4 


check in range of type int32 


F 


OF 


s 


fpuchki64 


3/4 


check in range of type int64 


F 



Processor cycles are shown as Typical/Maximum cycles. 



Table 4.26 IMS T800 floating point conversion operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


07 


s 


fpur32tor64 


3/4 


real32 to real64 


F 


08 


s 


fpur64tor32 


6/9 


real64 to real32 


F 


9D 


29 FD 


fprtoi32 


7/9 


real to int32 


F 


96 


29F6 


fpi32tor32 


8/10 


int32 to real32 




98 


29F8 


fpi32tor64 


8/10 


int32 to real64 




9A 


29FA 


fpb32tor64 


8/8 


bit32 to real64 




OD 


s 


fpunoround 


2/2 


real64 to real32, no round 




A1 


2AF1 


fpint 


5/6 


round to floating integer 


F 



Processor cycles are shown as Typical/Maximunfi cycles. 



Table 4.27 IMS T800 floating point arithmetic operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor cycles 


Name 


D 

E 


Single 


Double 


87 


28F7 


fpadd 


6/9 


6/9 


fp add 


F 


89 


28F9 


fpsub 


6/9 


6/9 


fp subtract 


F 


8B 


28FB 


fpmul 


11/18 


18/27 


fp multiply 


F 


80 


28FC 


fpdiv 


16/28 


31/43 


fp divide 


F 


OB 


S 


fpuabs 


2/2 


2/2 


fp absolute 


F 


8F 


28FF 


fpremfirst 


36/46 


36/46 


fp remainder first step 


F 


90 


29F0 


fpremstep 


32/36 


32/36 


fp remainder iteration 




01 


s 


fpusqrtfirst 


27/29 


27/29 


fp square root first step 


F 


02 


s 


fpusqrtstep 


42/42 


42/42 


fp square root step 




03 


s 


fpusqrtlast 


8/9 


8/9 


fp square root end 




OA 


s 


fpuexpinc32 


6/9 


6/9 


multiply by 2^^ 


F 


09 


s 


fpuexpdec32 


6/9 


6/9 


divide by 2^^ 


F 


12 


s 


fpumulby2 


6/9 


6/9 


multiply by 2.0 


F 


11 


8 


fpudivby2 


6/9 


6/9 


divide by 2.0 


F 



Processor cycles are shown as Typical/Maximum cycles. 
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5 Floating point unit 

The 64 bit FPU provides single and double length arithmetic to floating point standard ANSI-IEEE 754-1985. 
It Is able to perform floating point arithmetic concurrently with the central processor unit (CPU), sustaining 
3.3 Mflops on a 30 MHz device. All data communication between memory and the FPU occurs under control 
of the CPU. 

The FPU consists of a microcoded computing engine with a three deep floating point evaluation stack for 
manipulation of floating point numbers. These stack registers are FA, FB and FC, each of which can hold 
either 32 bit or 64 bit data; an associated flag, set when a floating point value is loaded, indicates which. The 
stack behaves In a similar manner to the CPU stack (page 193). 

As with the CPU stack, the FPU stack is not saved when rescheduling (page 196) occurs. The FPU can 
be used in both low and high priority processes. When a high priority process interrupts a low priority one 
the FPU state is saved inside the FPU. The CPU will service the interrupt immediately on completing its 
current operation. The high priority process will not start, however, before the FPU has completed its current 
operation. 

Points in an instruction stream where data need to be transferred to or from the FPU are called synchronisation 
points. At a synchronisation point the first processing unit to become ready will wait until the other is ready. 
The data transfer will then occur and both processors will proceed concurrently again. In order to make 
full use of concurrency, floating point data source and destination addresses can be calculated by the CPU 
whilst the FPU is performing operations on a previous set of data. Device performance is thus optimised by 
minimising the CPU and FPU idle times. 

The FPU has been designed to operate on both single length (32 bit) and double length (64 bit) floating 
point numbers, and returns results which fully conform to the ANSI-IEEE 754-1985 floating point arithmetic 
standard. Denormalised numbers are fully supported in the hardware. All rounding modes defined by the 
standard are implemented, with the default being round to nearest. 

The basic addition, subtraction, multiplication and division operations are performed by single instructions. 
However, certain less frequently used floating point instructions are selected by a value In register A (when 
allocating registers, this should be taken into account). A load constant instruction Idc is used to load 
register A; the floatingpoint entry instruction fpentryXhen uses this value to select the floating point operation. 
This pair of instructions is termed a selector sequence. 

Names of operations which use fpentry begin with fpu. A typical usage, returning the absolute value of a 
floating point number, would be 

Idc fpuabs; fpentry; 

Since the indirection code for fpuabs is OB, It would be encoded as 



Table 5.1 fpentry coding 



Mnemonic 



Function 
code 



Memory 
code 



Idc 

fpentry 
is coded as 

pfix 
opr 



fpuabs 

(op. code #AB) 

#A 
#B 



#4 



#2 
#F 



#4B 

#2AFB 

#2A 
#FB 
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The remainder an6 square root instructions take considerably longer than other instructions to complete. In 
order to minimise the interrupt latency period of the transputer they are split up to form instruction sequences. 
As an example, the instruction sequence for a single length square root is 

fpusqrtfirst; fpusqrtstep; fpusqrtstep; fpusqrtlast; 

The FPU has its own error flag FP.Error. This reflects the state of evaluation within the FPU and is set in 
circumstances where invalid operations, division by zero or overflow exceptions to the ANSI-IEEE 754-1985 
standard would be flagged (page 202). FP^Errorls also set if an input to a floating point operation is infinite or 
is not a number (NaN). The FP.Error flag can be set, tested and cleared without affecting the main Error flag, 
but can also set Error when required (page 202). Depending on how a program is compiled, it is possible for 
both unchecked and fully checked floating point arithmetic to be performed. 

Further details on the operation of the FPU can be found in Transputer Instruction Set - A Conripiler Writers' 
Guide. 



Table 5.2 Typical floating point operation times for IMS T800 



Operation 


T800-20 


1800-30 


Single length 


Double length 


Single length 


Double length 


add 

subtract 
multiply 
divide 


350 ns 
350 ns 
550 ns 
850 ns 


350 ns 

350 ns 

1000 ns 

1600 ns 


233 ns 
233 ns 
367 ns 
567 ns 


233 ns 
233 ns 
667 ns 

1067 ns 



Timing is for operations where both operands are normalised fp numbers. 
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6 System services 

System services include all the necessary logic to initialise and sustain operation of the device. They also 
include error handling and analysis facilities. 



6.1 



Power 



Power is supplied to the device via the VCC and GND pins. Several of each are provided to minimise 
inductance within the package. All supply pins must be connected. The supply must be decoupled close to 
the chip by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer 
boards are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, otherwise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 



6.2 



CapPlus, CapMinus 



The internally derived power supply for internal clocks requires an external low leakage, low inductance VF 
capacitor to be connected between CapPius and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz and 10 MHz. If a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total PCB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 









P.C.B. track 




VCC -T" 

T 


CapPlus 




1 






Decoupling 
=4= capacitor 
1/iF 




Phase-locked 
loops 






1 






1. 
GND -^ 


CapMinus 


P.C.B. track 









Figure 6.1 Recommended PLL decoupling 



6.3 



Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer devices may be connected to a common clock, or may have individual clocks providing 
each one meets the specified stability criteria. In a multi-clock system the relative phasing of Clockln clocks 
Is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant provided the 
specified limits of Clockln pulse widths are met. 

Oscillator stability is important. Clockln must be derived from a crystal oscillator; RC oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 6.1 Input clock 



SYIVIBOL 


PARAIVIETER 


IVIIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1,2,4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1.3 


TDC1DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise time 






10 


ns 


1,5 


TDCf 


Clockln fall time 






8 


ns 


1,5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 1 1.3). 



TDCerror 



, 5, 2.0V- - --^K^ 



i TDCerror 



TDCerror ^ 




TDCLDCH TDCHDCL 




TDCLDCL 

90% 

10% 



^ TDCerror 




Figure 6.2 Clockln timing 



6.4 



ProcSpeedSelectO-2 



Processor speed of the IMS T800 is variable in discrete steps. The desired speed can be selected, up to the 
maximum rated for a particular component, by the three speed select lines ProcSpeedSelectO-2. The pins 
are tied high or low, according to the table below, for the various speeds. The pins are arranged so that the 
IMS T425 can be plugged directly into a board designed for a IMS T800. 

Only six of the possible speed select combinations are currently used; the other two are not valid speed 
selectors. The frequency of Clockln for the speeds given in the table is 5 MHz. 



6 System services 



213 



Table 6.2 Processor speed selection 



Proc 
Speed 
Select2 


Proc 
Speed 
Selecti 


Proc 
Speed 
SelectO 


Processor 

Clock 
Speed MHz 


Processor 

Cycle 
Time ns 


Notes 











20.0 


50.0 










1 


22.5 


44.4 







1 





25.0 


40.0 







1 


1 


30.0 


33.3 




1 








35.0 


28.6 




1 





1 






Invalid 


1 


1 





17.5 


57.1 




1 


1 


1 






Invalid 



Note: Inclusion of a speed selection in this table does not imply immediate availability. 



6.5 



Reset 



Reset can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. After VCC is 
valid Clockln should be running for a minimum period TDCVRL before the end of Reset. The falling edge of 
Reset initialises the transputer, triggers the memory configuration sequence and starts the bootstrap routine. 
Link outputs are forced low during reset; link inputs and EventReq should be held low. Memory request 
(DMA) must not occur whilst Reset is high but can occur before bootstrap (page 238). 

After the end of Reset there will be a delay of 144 periods of Clockln (figure 6.3). Following this, the 
MemWrDO, MemRfDI and MemAD2-31 pins will be scanned to check for the existence of a pre-programmed 
memory interface configuration (page 228). This lasts for a further 144 periods of Clockln. Regardless of 
whether a configuration was found, 36 configuration read cycles will then be performed on external memory 
using the default memory configuration (page 230), in an attempt to access the external configuration ROM. 
A delay will then occur, its period depending on the actual configuration. Finally eight complete and con- 
secutive refresh cycles will initialise any dynamic RAM, using the new memory configuration. If the memory 
configuration does not enable refresh of dynamic RAM the refresh cycles will be replaced by an equivalent 
delay with no external memory activity. 

If BootFromRom is high bootstrapping will then take place immediately, using data from external memory; 
otherwise the transputer will await an input from any link. The processor will be in the low priority state. 



Reset L 

Action h 



Delay 



Internal 



External 



configuration configuration 



Delay 



Refresh 



Boot 



Figure 6.3 IMS T800 post-reset sequence 



6.6 



Bootstrap 



The transputer can be bootstrapped either from a link or from external ROM. To facilitate debugging, Boot- 
FromRom may be dynamically changed but must obey the specified timing restrictions. It is sampled once 
only by the transputer, before the first instruction is executed after Reset is taken low. 

If BootFromRom Is connected high (e.g. to VCC) the transputer starts to execute code from the top two bytes 
in external memory, at address #7FFFFFFE. This location should contain a backward jump to a program in 
ROM. Following this access, BootFromRom may be taken low if required. The processor is in the low priority 
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state, and the IV register points to MemStart (page 217). 



Table 6.3 Reset and Analyse 



SYIVIBOL 


PARAMETER 


MiN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 


Power valid before Reset 


10 






ms 




TRHRL 


Reset pulse width high 


8 






Clockln 


1 


TDCVRL 


Clock! n running before Reset end 


10 






ms 


2 


TAHRH 


Analyse setup before Reset 


3 






ms 




TRLAL 


Analyse hold after Reset end 


1 






Clockln 


1 


TBRVRL 


BootFromRom setup 









ms 




TRLBRX 


BootFromRom hold after Reset 









ms 


3 


TALBRX 


BootFromRom hold after Analyse 










3 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 

3 Mu§t be stable until after end of bootstrap period. See Bootstrap section. 
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Figure 6.4 Transputer reset timing with Analyse low 
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Figure 6.5 Transputer reset and analyse timing 
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If BootFromRom is connected low (e.g. to GND) the transputer will wait for the first bootstrap message to 
arrive on any one of its links. The transputer is ready to receive the first byte on a link within two processor 
cycles TPCLPCL after Reset goes low. 

If the first byte received (the control byte) is greater than 1 it is taken as the quantity of bytes to be input. The 
following bytes, to that quantity, are then placed in internal memory starting at location MemStart. Following 
reception of the last byte the transputer will start executing code at MemStart as a low priority process. 
BootFromRom may be taken high after reception of the last byte, if required. The memory space immediately 
above the loaded code Is used as work space. Messages arriving on other links after the control byte has 
been received and on the bootstrapping link after the last bootstrap byte will be retained until a process inputs 
from them. 

6.7 Peek and poke 

Any location in internal or external memory can be interrogated and altered when the transputer is waiting 
for a bootstrap from link. If the control byte is then eight more bytes are expected on the same link. The 
first four byte word is taken as an internal or external memory address at which to poke (write) the second 
four byte word. If the control byte is 1 the next four bytes are used as the address from which to peek (read) 
a word of data; the word is sent down the output channel of the same link. 

Following such a peek or poke, the transputer returns to its previously held state. Any number of accesses 
may be made in this way until the control byte is greater than 1 , when the transputer will commence reading 
its bootstrap program. Any link can be used, but addresses and data must be transmitted via the same link 
as the control byte. 

6.8 Analyse 

If Analyse Is taken high when the transputer is running, the transputer will halt at the next descheduling point 
(page 201). From Analyse being asserted, the processor will halt within three time slice periods plus the 
time taken for any high priority process to complete. As much of the transputer status is maintained as is 
necessary to permit analysis of the halted machine. Processor flags Error and HaltOnError are not altered 
at reset, whether Analyse is asserted or not. Memory refresh continues. 

Input links will continue with outstanding transfers. Output links will not make another access to memory 
for data but will transmit only those bytes already In the link buffer. Providing there is no delay in link 
acknowledgement, the links should be inactive within a few microseconds of the transputer halting. 

Reset should not be asserted before the transputer has halted and link transfers have ceased. When Reset 
is taken low whilst Analyse is high, neither the memory configuration sequence nor the block of eight refresh 
cycles will occur; the previous memory configuration will be used for any external memory accesses. If 
BootFromRom is high the transputer will bootstrap as soon as Analyse is taken low, othenA/ise it will await a 
control byte on any link. If Analyse is taken low without Reset going high the transputer state and operation 
are undefined. After the end of a valid Analyse sequence the registers have the values given In table 6.4. 

Table 6.4 Register values after Analyse 



/ MemStart if bootstrapping from a link, or the external memory bootstrap address if 
bootstrapping from ROM. 

W MemStart if bootstrapping from ROM, or the address of the first free word after the 
bootstrap program if bootstrapping from link. 

A The value of / when the processor halted. 

B The value of IV when the processor halted, together with the priority of the process 
when the transputer was halted (I.e. the IV descriptor). 

C The ID of the bootstrapping link if bootstrapping from link. 
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6.9 



Error, Errorin 



The Error pin carries the OR'ed output of the internal Error flag and the Errorin input. If Error is high 
it indicates either that Errorin is high or that an error was detected in one of the processes. An internal 
error can be caused, for example, by arithmetic overflow, divide by zero, array bounds violation or software 
setting the flag directly (page 202). It can also be set from the floating point unit under certain circumstances 
(page 202, 209). Once set, the Error flag is only cleared by executing the instruction testerr. The error is not 
cleared by processor reset, in order that analysis can Identify any errant transputer (page 215). 

A process can be programmed to stop if the Error flag is set; it cannot then transmit erroneous data to other 
processes, but processes which do not require that data can still be scheduled. Eventually all processes 
which rely, directly or indirectly, on data from the process in error will stop through lack of data. Errorin does 
not directly affect the status of a processor in any way. 

By setting the HaltOn Error i\aQ the transputer itself can be programmed to halt if Error becomes set. If Error 
becomes set after HaltOnErrorhas been set, all processes on that transputer will cease but will not necessarily 
cause other transputers in a network to halt. Setting HaltOn Error a^ter Error will not cause the transputer to 
halt; this allows the processor reset and analyse facilities to function with the flags in indeterminate states. 

An alternative method of error handling is to have the errant process or transputer cause all transputers 
to halt. This can be done by 'daisy-chaining' the Errorin and Error pins of a number of processors and 
applying the final Error output signal to the EventReq pin of a suitably programmed master transputer. Since 
the process state is preserved when stopped by an error, the master transputer can then use the analyse 
function to debug the fault. When using such a circuit, note that the Error flag is in an indeterminate state on 
power up; the circuit and software should be designed with this in mind. 

Error checks can be removed completely to optimise the performance of a proven program; any unexpected 
error then occurring will have an arbitrary undefined effect. 

If a high priority process pre-empts a low priority one, status of the Error and Ha/fOnError flags is saved for 
the duration of the high priority process and restored at the conclusion of it. Status of both flags is transmitted 
to the high priority process. Either flag can be altered in the process without upsetting the error status of any 
complex operation being carried out by the pre-empted low priority process. 

In the event of a transputer halting because of HaltOnError, the links will finish outstanding transfers before 
shutting down. If Analyse is asserted then all inputs continue but outputs will not make another access to 
memory for data. Memory refresh will continue to take place. 

After halting due to the Error flag changing from to 1 whilst HaltOnError Is set, register /points two bytes 
past the instruction which set Error. After halting due to the Analyse pin being taken high, register / points 
one byte past the instruction being executed. In both cases /will be copied to register A. 
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Figure 6.6 Error handling in a multi-transputer system 
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7 Memory 

The IMS T800 has 4 Kbytes of fast internal static memory for high rates of data throughput. Each internal 
memory access takes one processor cycle ProcClockOut (page 219). The transputer can also access 
4 Gbytes of external memory space. Internal and external memory are part of the same linear address 
space. Internal RAM can be disabled by holding DisablelntRAM high. All internal addresses are then 
mapped to external RAM. This pin should not be altered after Reset has been taken low. 

IMS T800 memory is byte addressed, with words aligned on four-byte boundaries. The least significant byte 
of a word is the lowest addressed byte. 

The bits In a byte are numbered to 7, with bit the least significant. The bytes are numbered from 0, with 
byte the least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Internal memory starts at the most negative address #80000000 and extends to #80000FFF. User memory 
begins at #80000070; this location is given the name MemStart. 

The reserved area of internal memory below MemStart is used to implement link and event channels. 

Two words of memory are reserved for timer use, TPtrLocO for high priority processes and TPtrLod for low 
priority processes. They either indicate the relevant priority timer is not in use or point to the first process on 
the timer queue at that priority level. 

Values of certain processor registers for the current low priority process are saved in the reserved IntSaveLoc 
locations when a high priority process pre-empts a low priority one. Other locations are reserved for extended 
features such as block moves and floating point operations. 

External memory space starts at #80001000 and extends up through #00000000 to #7FFFFFFF. Memory 
configuration data and ROM bootstrapping code must be in the most positive address space, starting at 
#7FFFFF6C and #7FFFFFFE respectively. Address space immediately below this is conventionally used for 
ROM based code. 



218 



6 [MS T800 engineering data 





hi Machine map lo 


Byte address 


Word offsets 


Occam map 






Reset Inst 


#7FFFFFFE 

#7FFFFFF8 

#7FFFFF60 

#0 

#80001 000 - Start of external memory - #0400 

#80000070 MemStart MemStart #10 ' 






Memory configuration 








Reserved for 
Extended functions 


#80000060 ^ 

#80000048 

#80000044 

#80000040 

#80000030 

#80000038 

#80000034 

#80000030 

#80000020 

#80000028 

#80000024 

#80000020 

#80000010 

#80000018 

#80000014 

#80000010 

#80000000 

#80000008 

#80000004 

#80000000 y 


> Note 1 

#08 
#07 
#06 
#05 
#04 
#03 
#02 
#01 
(Base of memory) #00 




EreglntSaveLoc 


STATUSIntSaveLoc 


OreglntSaveLoc 


BreglntSaveLoc 


AreglntSaveLoc 


IptrlntSaveLoc 


WdesclntSaveLoc 


TPtrLocI 


TPtrLocO 


Event 


Event 


Link 3 Input 


Link 3 Input 


Link 2 Input 


Link 2 Input 


Link 1 Input 


Link 1 Input 


Link Input 


Link Input 


Link 3 Output 


Link 3 Output 


Link 2 Output 


Link 2 Output 


Link 1 Output 


Link 1 Output 


Link Output 


Link Output 















Notes 



Figure 7.1 IMS T800 memory map 



1 These locations are used as auxiliary processor registers and should not be manipulated by the user. Like 
processor registers, their contents may be useful for implementing debugging tools (Analyse, page 215). For 
details see Transputer Instruction Set - A Compiler Writers' Guide. 
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8 External memory interface 

The External Memory Interface (EMI) allows access to a 32 bit address space, supporting dynamic and static 
RAM as well as ROM and EPROM. EMI timing can be configured at Reset to cater for most memory types 
and speeds, and a program is supplied with the Transputer Development System to aid in this configuration. 

There are 17 Internal configurations which can be selected by a single pin connection (page 228). If none 
are suitable the user can configure the Interface to specific requirements, as shown In page 230. 

8.1 ProcClockOut 

This clock Is derived from the internal processor clock, which Is in turn derived from Clockln. Its period is 
equal to one Internal microcode cycle time, and can be derived from the formula 

TPCLPCL = TDCLDCL / PLLx 

where TPCLPCL is the ProcClockOut Period, TDCLDCL Is the Clockln Period and PLLx Is the phase 
lock loop factor for the relevant speed part, obtained from the ordering details (Ordering section). 

The time value Tm is used to define the duration of Tstates and, hence, the length of external memory cycles; 
its value is exactly half the period of one ProcClockOut cycle (0.5*TPCLPCL), regardless of mark/space 
ratio of ProcClockOut. 

Edges of the various external memory strobes coincide with rising or falling edges of ProcClockOut. It should 
be noted, however, that there is a skew associated with each coincidence. The value of skew depends on 
whether coincidence occurs when the ProcClockOut edge and strobe edge are both rising, when both are 
falling or if either Is rising when the other is falling. Timing values given in the strobe tables show the best 
and worst cases. If a more accurate timing relationship is required, the exact Tstate timing and strobe edge 
to ProcClockOut relationships should be calculated and the correct skew factors applied from the edge skew 
timing table 8.4. 

The timing parameters in the following tables are based on full characterisation of the 17 MHz and 20 MHz 
parts. Data for higher speeds Is based on tests on a limited number of samples and may change when full 
characterisation is completed. 

8.2 Tstates 

The external memory cycle Is divided into six Tstates with the following functions: 

T1 Address setup time before address valid strobe. 

T2 Address hold time after address valid strobe. 

T3 Read cycle tristate or write cycle data setup. 

T4 Extendable data setup time. 

T5 Read or write data. 

T6 Data hold. 

Under normal conditions each Tstate may be from one to four periods Tm long, the duration being set during 
memory configuration. The default condition on Reset Is that all Tstates are the maximum four periods Tm 
long to allow external initialisation cycles to read slow ROM. 

Period T4 can be extended indefinitely by adding externally generated wait states. 

An external memory cycle is always an even number of periods Tm in length and the start of T1 always 
coincides with a rising edge of ProcClockOut. If the total configured quantity of periods Tm Is an odd 
number, one extra period Tm will be added at the end of T6 to force the start of the next T1 to coincide with 
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a rising edge of ProcClockOut. This period is designated E in configuration diagrams (figure 8.11). 

Table 8.1 ProcClockOut 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLPCL 


ProcClockOut period 


a-2 


a 


a+2 


ns 


1,5 


TPCHPCL 


ProcClockOut pulse width high 


b-1 1 .5 


b 


b+3.5 


ns 


2.5 


TPCLPCH 


ProcClockOut pulse width low 




c 




ns 


3,5 


Tm 


ProcClockOut half cycle 


b-1 


b 


b+1 


ns 


2,5 


TPCstab 


ProcClockOut stability 






8 


% 


4,5 



Notes 



1 a is TDCLDCL/PLLx. 

2 b is 0.5*TPCLPCL (half the processor clock period). 

3 c is TPCLPCL-TPCHPCL. 

4 Stability is the variation of cycle periods between two consecutive cycles, measured at corresponding points on 
the cycles. 

5 This parameter is sampled and not 100% tested. 
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Figure 8.1 IMS T800 ProcClockOut timing 



8.3 



Internal access 



During an internal memory access cycle the external memory interface bus MemAD2-31 reflects the word 
address used to access internal RAM, MemnotWrDO reflects the read/write operation and MemnotRfDI is 
high; all control strobes are inactive. This is true unless and until a memory refresh cycle or DMA (memory 
request) activity takes place, when the bus will carry the appropriate external address or data. 

The bus activity is not adequate to trace the internal operation of the transputer in full, but may be used for 
hardware debugging in conjuction with peek and poke (page 215). 
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Figure 8.2 IMS T800 bus activity for internal memory cycle 
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8.4 MemAD2-31 

External memory addresses and data are multiplexed on one bus. Only the top 30 bits of address are 
output on the external memory interface, using pins MemAD2-31. They are normally output only during 
Tstates T1 and T2, and should be latched during this time. Byte addressing is carried out internally by the 
transputer for read cycles. For write cycles the relevant bytes in memory are addressed by the write strobes 
notMemWrBO-3. 

The data bus is 32 bits wide. It uses MemAD2-31 for the top 30 bits and MemnotRfDI and MemnotWrDO 
for the lower two bits. Read cycle data may be set up on the bus at any time after the start of T3, but must 
be valid when the transputer reads it at the end of T5. Data may be removed any time during T6, but must 
be off the bus no later than the end of that period. 

Write data is placed on the bus at the start of T3 and removed at the end of 16. If T6 is extended to force 
the next cycle Tmx (page 221) to start on a rising edge of ProcClockOut, data will be valid during this time 
also. 

8.5 MemnotWrDO 

During 11 and T2 this pin will be low if the cycle is a write cycle, otherwise it will be high. During Tstates T3 
to T6 it becomes bit of the data bus. In both cases it follows the general timing of MemAD2-31. 

8.6 MemnotRfDI 

During T1 and T2, this pin is low if the address on MemAD2-31 is a refresh address, otherwise it is high. 
During Tstates T3 to T6 it becomes bit 1 of the data bus. In both cases it follows the general timing of 
MemAD2-31. 

8.7 notMemPd 

For a read cycle the read strobe notMemRd is low during T4 and T5. Data is read by the transputer on the 
rising edge of this strobe, and may be removed Immediately afterward. If the strobe duration is insufficient it 
may be extended by adding extra periods Tm to either or both of the Tstates T4 and T5. Further extension 
may be obtained by inserting wait states at the end of T4. 

In the read cycle timing diagrams ProcClockOut is included as a guide only; it is shown with each Tstate 
configured to one period Tm. 

8.8 notMemSO-4 

To facilitate control of different types of memory and devices, the EMI is provided with five strobe outputs, 
four of which can be configured by the user. The strobes are conventionally assigned the functions shown in 
the read and write cycle diagrams, although there is no compulsion to retain these designations. 

notMemSO is a fixed format strobe. Its leading edge is always coincident with the start of T2 and its trailing 
edge always coincident with the end of T5. 

The leading edge of notMemSI is always coincident with the start of T2, but its duration may be configured 
to be from zero to 31 periods Tm. Regardless of the configured duration, the strobe will terminate no later 
than the end of T6. The strobe is sometimes programmed to extend beyond the normal end of Tmx. When 
wait states are inserted into an EMI cycle the end of Tmx is delayed, but the potential active duration of the 
strobe is not altered. Thus the strobe can be configured to terminate relatively early under certain conditions 
(page 236). If notMemSI is configured to be zero it will never go low. 



222 



6 IMS T800 engineering data 



notMemS2, notMemSS and notMemS4 are identical in operation. They ail terminate at the end of T5, but 
the start of each can be delayed from one to 31 periods Tm beyond the start of T2. If the duration of one of 
these strobes would take it past the end of T5 it will stay high. This can be used to cause a strobe to become 
active only when wait states are inserted. If one of these strobes is configured to zero it will never go low. 
Figure 8.5 shows the effect of Wait on strobes in more detail; each division on the scale is one period Tm. 

Table 8.2 Read 



SYiVIBOL 


PARAIVIETER 


IVIiN 


NOIVI 


MAX 


UNITS 


NOTE 


TaZdV 


Address tristate to data valid 









ns 




TdVRdH 


Data setup before read 


25 






ns 




TRdHdX 


Data hold after read 









ns 




TSOLRdL 


notMemSO before start of read 


a-4 


a 


a+4 


ns 


1 


TSOHRdH 


End of read from end of notMemSO 


-4 




4 


ns 




TRdLRdH 


Read period 


b-3 




b+5 


ns 


2 



Notes 



1 a is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

2 b is total of T4+Twalt+T5 where T4, T5 can be from one to four periods Tm each in length and Twalt may be 
any number of periods Tm in length. 
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Figure 8.3 IMS T800 external read cycle: static memory 
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Figure 8.4 IMS T800 external read cycle: dynamic memory 
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Table 8.3 IMS T800 strobe timing 



SYMBOL 


(n) 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaVSOL 




Address setup before notMemSO 


a-8 






ns 


1 


TSOLaX 




Address hold after notMemSO 


b-8 


b 


b+8 


ns 


2 


TSOLSOH 




notMemSO pulse width low 


c-5 




c-i-6 


ns 


3 


TS0LS1L 


1 


notMemSI from notMemSO 


-4 




4 


ns 




TS0LS1H 


5 


notMemSl end from notMemSO 


d-1 




6+9 


ns 


4,6 


TS0HS1H 


9 


notMemSI end from notMemSO end 


e-8 




e+4 


ns 


5.6 


TS0LS2L 


2 


notMemS2 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS2H 


6 


notMemS2 end from notMemSO 


c-5 




c+7 


ns 


3 


TS0HS2H 


10 


notMemS2 end from notMemSO end 


-4 




7 


ns 




TS0LS3L 


3 


notMemS3 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS3H 


7 


notMemS3 end from notMemSO 


c-5 




c+7 


ns 


3 


TS0HS3H 


11 


notMemS3 end from notMemSO end 


-4 




7 


ns 




TS0LS4L 


4 


notMemS4 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS4H 


8 


notMemS4 end from notMemSO 


C-5 




c+7 


ns 


3 


TS0HS4H 


12 


notMemS4 end from notMemSO end 


-4 




7 


ns 




Tmx 




Complete external memory cycle 




g 






8 



Notes 



1 a is T1 where T1 can be from one to four periods Tm in length. 

2 b is T2 where T2 can be from one to four periods Tm in length. 

3 c is total of T2+T3+T4+Twait+T5 where T2, T3, T4, T5 can be from one to four periods Tm each in length and 
Twait may be any number of periods Tm in length. 

4 d can be from zero to 31 periods Tm in length. 

5 e can be from -27 to +4 periods Tm in length. 

6 If the configuration would cause the strobe to remain active past the end of T6 it will go high at the end of T6. 
If the strobe is configured to zero periods Tm it will remain high throughout the complete cycle Tmx. 

7 f can be from zero to 31 periods Tm in length. If this length would cause the strobe to remain active past the 
end of T5 it will go high at the end of T5. If the strobe value is zero periods Tm it will remain low throughout 
the complete cycle Tmx. 

8 g is one complete external memory cycle comprising the total of T1+T2+T3+T4+Twait+T5+T6 where T1, T2, 
T3, T4, T5 can be from one to four periods Tm each in length, T6 can be from one to five periods Tm in length 
and Twait may be zero or any number of periods Tm in length. 
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Figure 8.5 IMS T800 effect of wait states on strobes 
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Table 8.4 Strobe SO to ProcClockOut skew 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCHSOH 
TPCLSOH 
TPCHSOL 
TPCLSOL 


notMemSO rising from ProcClockOut rising 
notMenfiSO rising from ProcClockOut falling 
notMemSO falling from ProcClockOut rising 
notMemSO falling from ProcClockOut falling 


-6 
-5 
-8 
-5 




4 
10 
3 
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ns 
ns 
ns 
ns 
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Figure 8.6 IMS T800 skew of notMemSO to ProcClockOut 
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notMemWrBO-3 



Because the transputer uses word addressing, four write strobes are provided; one to write each byte of the 
word. If a particular byte is not to be written, then the corresponding data outputs are tristated. notMemWrBO 
addresses the least significant byte. 

The transputer has both early and late write cycle modes. For a late write cycle the relevant write strobes 
notMemWrBO-3 are low during T4 and T5; for an early write they are also low during T3. Data should be 
latched into memory on the rising edge of the strobes in both cases, although it is valid until the end of T6. 
If the strobe duration is insufficient, it may be extended at configuration time by adding extra periods Tm to 
either or both of Tstates T4 and T5 for both early and late modes. For an early cycle they may also be added 
to T3. Further extension may be obtained by inserting wait states at the end of T4. If the data hold time is 
insufficient, extra periods Tm may be added to T6 to extend it. 



Table 8.5 Write 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TdVWrH 


Data setup before write 


d-7 




d+10 


ns 


1,5 


TWrHdX 


Data hold after write 


a-10 




a+5 


ns 


1,2 


TSOLWrL 


notMemSO before start of early write 


b-5 




b+5 


ns 


1,3 




notMemSO before start of late write 


c-5 




0+5 


ns 


1,4 


TSOHWrH 


End of write from end of notMemSO 


-5 




4 


ns 


1 


TWrLWrH 


Early write pulse width 


d-4 




d+7 


ns 


1,5 




Late write pulse width 


e-4 




e+7 


ns 


1.6 



Notes 



1 Timing is for ail write strobes notMemWrBO-3. 

2 a is T6 where T6 can be from one to five periods Tm in length. 

3 b is T2 where T2 can be from one to four periods Tm in length. 

4 c is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

5 d is total of T3+T4+Twalt+T5 where T3, T4, T5 can be from one to four periods Tm each in length and Twait 
may be zero or any number of periods Tm in length. 

6 e is total of T4+Twalt+T5 where T4, T5 can be from one to four periods Tm each in length and Twait may be 
zero or any number of periods Tm in length. 
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Figure 8.7 IMS T800 external write cycle 

In the write cycle timing diagrann ProcClockOut is Included as a guide only; it is shown with each Tstate 
configured to one period Tm. The strobe is inactive during Internal nnemory cycles. 
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Figure 8.8 IMS T800 dynamic RAM application 
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8.10 MemConfig 

l\/lemConfig is an input pin used to read configuration data when setting external memory interface (EMI) 
characteristics. It is read by the processor on two occasions after Reset goes low; first to check if one of the 
preset internal configurations is required, then to determine a possible external configuration. 

8.10.1 Internal configuration 

The internal configuration scan comprises 64 periods TDCLDCL of Clockln during the internal scan period 
of 144 Clockln periods. MemnotWrDO, MemnotRfDI and MemAD2-32 are all high at the beginning of the 
scan. Starting with MemnotWrDO, each of these lines goes low successively at intervals of two Clockln 
periods and stays low until the end of the scan. If one of these lines is connected to MemConfig the preset 
internal configuration mode associated with that line will be used as the EMI configuration. The default 
configuration is that defined in the table for MemAD31; connecting MemConfig to VCC will also produce 
this default configuration. Note that only 17 of the possible configurations are valid, all others remain at the 
default configuration. 

Table 8.6 IMS T800 internal configuration coding 





Duration of each Tstate 




Strobe 




Write 


Refresh 


Cycle 








periods Tm 






{ 


coefficient 




cycle 


interval 


time 




















Clockln 


Proc 


Pin 


71 


T2 


73 74 


75 


76 


Si 


s2 S3 


S4 


type 


cycles 


cycles 


MemnotWrDO 


1 


1 


1 1 


1 


1 


30 


1 3 


5 


late 


72 


3 


MemnotRfDI 


1 


2 


1 1 


1 


2 


30 


1 2 


7 


late 


72 


4 


MemAD2 


1 


2 


1 1 


2 


3 


30 


1 2 


7 


late 


72 


5 


MemADS 


2 


3 


1 1 


2 


3 


30 


1 3 


8 


late 


72 


6 


MemAD4 


1 


1 


1 1 


1 


1 


3 


1 2 


3 


early 


72 


3 


MemADS 


1 


1 


2 1 


2 


1 


5 


1 2 


3 


early 


72 


4 


MemADG 


2 


1 


2 1 


3 


1 


6 


1 2 


3 


early 


72 


5 


MemAD7 


2 


2 


2 1 


3 


2 


7 


1 3 


4 


early 


72 


6 


MemADS 


1 


1 


1 1 


1 


1 


30 


1 2 


3 


early 


t 


3 


MemADS 


1 


1 


2 1 


2 


1 


30 


2 5 


9 


early 


t 


4 


MemADIO 


2 


2 


2 2 


4 


2 


30 


2 3 


8 


late 


72 


7 


MemADII 


3 


3 


3 3 


3 


3 


30 


2 4 


13 


late 


72 


9 


MemAD12 


1 


1 


2 1 


2 


1 


4 


1 2 


3 


early 


72 


4 


MemADia 


2 


1 


2 1 


2 


2 


5 


1 2 


3 


early 


72 


5 


MemAD14 


2 


2 


2 1 


3 


2 


6 


1 3 


4 


early 


72 


6 


MemADIS 


2 


1 


2 3 


3 


3 


8 


1 2 


3 


early 


72 


7 


MemAD31 


4 


4 


4 4 


4 


4 


31 


30 30 


18 


late 


72 


12 



t Provided for static RAM only. 
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Figure 8.9 IMS T800 internal configuration 
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Delay 



Internal configuration 



External configuration 




MemnotWrDO 
MemnotRfDI 
MemAD2 
MemAD3 

MemAD31 
l\/lemConfig © 
l\/lemConfig ® 



CO Internal configuration: IVIemConfig connected to IVIemAD2 

© External configuration: IVlemCcnfig connected to inverse of IVIemADS 



Figure 8.10 IMS T800 Internal configuration scan 

8.10.2 External configuration 

If MemConfig is held low until MemnotWrDO goes low the internal configuration is ignored and an external 
configuration will be loaded instead. An external configuration scan always follows an internal one, but if an 
internal configuration occurs any external configuration is ignored. 

The external configuration scan comprises 36 successive external read cycles, using the default EMI con- 
figuration preset by MemADSI . However, instead of data being read on the data bus as for a normal read 
cycle, only a single bit of data is read on MemConfig at each cycle. Addresses put out on the bus for each 
read cycle are shown in table 8.7, and are designed to address ROM at the top of the memory map. The 
table shows the data to be held In ROM; data required at the MemConfig pin is the inverse of this. 

MemConfig is typically connected via an inverter to MemnotWrDO. Data bit zero of the least significant byte 
of each ROM word then provides the configuration data stream. By switching MemConfig between various 
data bus lines up to 32 configurations can be stored in ROM, one per bit of the data bus. MemConfig can be 
permanently connected to a data line or to GND. Connecting MemConfig to GND gives all Tstates configured 
to four periods; notMemSI pulse of maximum duration; notMemS2-4 delayed by maximum; refresh interval 
72 periods of Clockln; refresh enabled; late write. 

The external memory configuration table 8.7 shows the contribution of each memory address to the 13 con- 
figuration fields. The lowest 12 words (#7FFFFF6C to #7FFFFF98, fields 1 to 6) define the number of extra 
periods Tm to be added to each Tstate. If field 2 is 3 then three extra periods will be added to T2 to extend 
It to the maximum of four periods. 

The next five addresses (field 7) define the duration of notMemSI and the following fifteen (fields 8 to 1 0) 
define the delays before strobes notMemS2-4 become active. The five bits allocated to each strobe allow 
durations of from to 31 periods Tm, as described in strobes page 221 . 

Addresses #7FFFFFEC to #7FFFFFF4 (fields 1 1 and 12) define the refresh interval and whether refresh is to 
be used, whilst the final address (field 13) supplies a high bit to MemConfig if a late write cycle is required. 



The columns to the right of the coding table show the values of each configuration bit for the four sample 
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external configuration diagrams. Note the Inclusion of period E at the end of T6 In sonne diagrams. This Is 
inserted to bring the start of the next Tstate T1 to coincide with a rising edge of ProcClockOut (page 219). 

Wait states W have been added to show the effect of them on strobe timing; they are not part of a configuration. 
In each case which Includes wait states, two wait periods are defined. This shows that If a wait state would 
cause the start of T5 to coincide with a falling edge of ProcClockOut, another period Tm Is generated by 
the EMI to force it to coincide with a rising edge of ProcClockOut. This coincidence is only necessary if wait 
states are added, otherwise coincidence with a falling edge Is permitted. Any configuration memory access 
Is only permitted to be extended using wait, up to a total of 14 Clockln periods. 
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notMemRd L 


r 










notMemWr late L 


r 










MemWait O 1 


"~i 










MemWait &) 1 


1 





Example 3 
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© One wait state inserted 
© Two wait states Inserted 
@ Three wait states inserted 
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Figure 8.1 1 IMS T800 external configuration 



232 



6 IMS T800 engineering data 



Internal configuration 



Address 



External configuration 



o 

1^ 



LL 
U- 
U. 



ft 






u. 

LL 



O 

LL 



// 



MemnotWrDO f 

MemnotRfDI _J 

MemAD2 f 

MemADS 

MemADSI 



J 



^ 
^ 



^ 






MemConfig (T) [ 
notMemRd _\ |_J 

h 



i_jn 



^ 
^ 



^ 



^ 



JilLf 



^ 
^ 



^ 



^ 



^ 
^ 






"U-TJ 



^ 



^ 



// 



(D 



® 



lJ-7, 



O 

LU 



1^ 



Li. 
LL. 



LL 
LL 



Delay 



^ 



^ 



^ 
^ 



^ 



I 
^ 



^ 
^ 



^ 



^ 



^ 
^ 



^ 



I 
^ 



^ 
^ 



^ 



^u— i_n_rar-u 



® 



® 



® ' ® 



® MemConfig connected to inverse of MemnotWrDO 

® Configuration field 1 ; T1 configured for 2 periods Tm 

® Configuration field 2; T2 configured for 3 periods Tm 

® Configuration field 10; most significant bit of notMemS4 configured high 

© Configuration field 1 1 ; refresh interval configured for 36 periods Cioclcln 

® Configuration field 12; refresh enabled 

® Configuration field 13; early write cycle 



Figure 8.12 IMS T800 external configuration scan 
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Table 8.7 IMS T800 external configuration coding 



Scan 
cycle 


MemAD 
address 


Field 


Function 


Example diagram | 


1 


2 


3 


4 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 


7FFFFF6C 
7FFFFF70 
7FFFFF74 
7FFFFF78 
7FFFFF7C 
7FFFFF80 
7FFFFF84 
7FFFFF88 
7FFFFF8C 
7FFFFF90 
7FFFFF94 
7FFFFF98 


1 
1 
2 
2 
3 
3 
4 
4 
5 
5 
6 
6 


TI least significant bit 
TI most significant bit 
T2 least significant bit 
T2 most significant bit 
T3 least significant bit 
T3 most significant bit 
T4 least significant bit 
T4 most significant bit 
T5 least significant bit 
T5 most significant bit 
T6 least significant bit 
T6 most significant bit 





1 


1 







1 









1 















1 







1 







1 



1 







1 




13 
14 
15 
16 
17 


7FFFFF9C 
7FFFFFA0 
7FFFFFA4 
7FFFFFA8 
7FFFFFAC 


7 

7 
7 
7 

7 


notMemSI least significant bit 
notMemSI most significant bit 






1 











1 







1 







18 
19 
20 
21 
22 


7FFFFFB0 
7FFFFFB4 
7FFFFFB8 
7FFFFFBC 
7FFFFFC0 


8 
8 
8 
8 
8 


notMemS2 least significant bit 
notMemS2 most significant bit 


1 

1 








1 













1 

1 
1 





23 
24 
25 
26 
27 


7FFFFFC4 
7FFFFFC8 
7FFFFFCC 
7FFFFFD0 
7FFFFFD4 


9 
9 
9 
9 
9 


notMemS3 least significant bit 
notMemS3 most significant bit 


1 







1 

1 
1 





1 




1 




1 



1 





28 
29 
30 
31 
32 


7FFFFFD8 
7FFFFFDC 
7FFFFFE0 
7FFFFFE4 
7FFFFFE8 


10 
10 
10 
10 
10 


notMemS4 least significant bit 
notMemS4 most significant bit 





1 







1 

1 







1 






1 

1 






33 
34 
35 
36 


7FFFFFEC 
7FFFFFF0 
7FFFFFF4 
7FFFFFF8 


11 
11 
12 
13 


Refresh Interval least significant bit 
Refresh Interval most significant bit 
Refresh Enable 
Late Write 





1 


1 
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Table 8.8 IMS T800 memory refresh configuration coding 



Refresii 


Interval 


Field 11 


Complete 


interval 


in /xs 


encoding 


cycle (mS) 


18 


3.6 


00 


0.922 


36 


7.2 


01 


1.843 


54 


10.8 


10 


2.765 


72 


14.4 


11 


3.686 



Refresh intervals are in periods of Clockln and Clocicln frequency is 5 MHz: 

Interval = 18 * 200 = 3600 ns 

Refresh interval is between successive incremental refresh addresses. 
Complete cycles are shown for 256 row DRAMS. 

Table 8.9 Memory configuration 



SYIVIBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMCVRdH 
TRdHMCX 
TSOLRdH 


Memory configuration data setup 
Memory configuration data hold 
notMemSO to configuration data read 


25 



a-12 




a+12 


ns 
ns 
ns 


1 



Notes 



1 a is 1 6 periods Tm. 



Tstate i T1 
Tm I ' • 
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MemnotRfDI 
MemAD2--31 
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T2 



"V 



T3 I T4 , T5 

I I I I I I I I i I I 



T6 I I T1 

I I I I I I I I I I 



• ^«««««« o ^S^ 
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X. 
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Figure 8.13 IMS T800 external configuration read cycle timing 
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8.11 



notMemRf 



The IMS T800 can be operated with memory refresh enabled or disabled. The selection is made during 
memory configuration, when the refresh interval is also determined. Refresh cycles do not interrupt internal 
memory accesses, although the internal addresses cannot be reflected on the external bus during refresh. 

When refresh is disabled no refresh cycles occur. During the post-Reset period eight dummy refresh cycles 
will occur with the appropriate timing but with no bus or strobe activity. 

A refresh cycle uses the same basic external memory timing as a normal external memory cycle, except that 
it starts two periods Tm before the start of T1. if a refresh cycle is due during an external memory access, 
it will be delayed until the end of that external cycle. Two extra periods Tm (periods R in the diagram) will 
then be inserted between the end of T6 of the external memory cycle and the start of T1 of the refresh cycle 
itself. The refresh address and various external strobes become active approximately one period Tm before 
T1. Bus signals are active until the end of T2, whilst notMemRf remains active until the end of T6. 

For a refresh cycle, MemnotRfDI goes low before notMemRf goes low and MemnotWrDO goes high with 
the same timing as MemnotRfDI. All the address lines share the same timing, but only MemAD2-11 give 
the refresh address. MemAD12-30 stay high during the address period, whilst MemAD31 remains low. 
Refresh cycles generate strobes notMemSO-4 with timing as for a normal external cycle, but notMemRd and 
notMemWrBO-3 remain high. Mem Wait operates normally during refresh cycles. 

Table 8.10 Memory refresh 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TRfLRfH 

TRaVSOL 

TRfLSOL 


Refresh pulse width low 

Refresh address setup before notMemSO 

Refresh indicator setup before notMemSO 


a-2 
b-12 
b-4 


b 


a+9 
b+6 


ns 
ns 
ns 


1 
2 



Notes 



1 a is total Tmx+Tm. 

2 b is total Tl+Tm where T1 can be from one to four periods Tm in length. 
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Figure 8.14 IMS T800 refresh cycle timing 



8.12 



l\/lemWait 



Taking IVIemWait high with the timing shown will extend the duration of 74. IVlemWait is sampled relative 
to the falling edge of ProcClockOut during a T3 period, and should not change state in this region. By 
convention, noti\/IemS4 is used to synchronize wait state insertion. If this or another strobe is used, its delay 
should be such as to take the strobe low an even number of periods Tm after the start of T1, to coincide with 
a rising edge of ProcClocl<Out. 

IVIemWait may be kept high indefinitely, although if dynamic memory refresh is used it should not be kept 
high long enough to interfere with refresh timing. MemWalt operates normally during all cycles, including 
refresh and configuration cycles. It does not affect internal memory access in any way. 

If the start of T5 would coincide with a falling edge of ProcClockOut an extra wait period Tm (EW) is 
generated by the EMI to force coincidence with a rising edge. Rising edge coincidence is only forced if wait 
states are added, otherwise coincidence with a falling edge is permitted. 
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Table 8.1 1 Memory wait 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLWtH 
TPCLWtL 
TWtLWtH 


Wait setup 

Wait hold 

Delay before re-assertion of Walt 


-(0.5Tm+9) 

0.5Tm+10 

2Tm 






ns 
ns 


1.2 
1,2 



Notes 



1 ProcClockOut load should not exceed 50pf. 

2 If wait period exceeds refresh Interval, refresh cycles will be lost. 
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Figure 8.15 IMS T800 memory wait timing 
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8.13 MemReq, MemGranted 

Direct memory access (DMA) can be requested at any time by taking the asynchronous MemReq input high. 
The transputer samples MemReq during the final period Tm of T6 of both refresh and external memory 
cycles. To guarantee taking over the bus immediately following either, MemReq must be set up at least two 
periods Tm before the end of T6. In the absence of an external memory cycle, MemReq is sampled during 
every low period of ProcClockOut. The address bus is tristated two periods Tm after the ProcClockOut 
rising edge which follows the sample. MemGranted is asserted one period Tm after that. 

Removal of MemReq is sampled during each low period of ProcClockOut and MemGranted is removed 
synchronously with the next falling edge of ProcClockOut. If accurate timing of DMA is required, MemReq 
should be set low coincident with a falling edge of ProcClockOut. Further external bus activity, either refresh, 
external cycles or reflection of internal cycles, will commence at the next rising edge of ProcClockOut. 

Strobes are left in their inactive states during DMA. DMA cannot interrupt a refresh or external memory cycle, 
and outstanding refresh cycles will occur before the bus is released to DMA. DMA does not interfere with 
internal memory cycles in any way, although a program running in internal memory would have to wait for 
the end of DMA before accessing external memory. DMA cannot access internal memory. If DMA extends 
longer than one refresh interval (Memory Refresh Configuration Coding, table 8.8), the DMA user becomes 
responsible for refresh. DMA may also inhibit an internally running program from accessing external memory. 

DMA allows a bootstrap program to be loaded into external RAM ready for execution after reset. If MemReq is 
held high throughout reset, MemGranted will be asserted before the bootstrap sequence begins. MemReq 
must be high at least one period TDCLDCL of Clockln before Reset. The circuit should be designed to 
ensure correct operation if Reset could interrupt a normal DMA cycle. 

Table 8.12 Memory request 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMRHMGH 
TMRLMGL 
TADZMGH 
TMGLADV 


Memory request response time 
Memory request end response time 
Bus tristate before memory granted 
Bus active after end of memory granted 


4Tm-2ns 
2Tm-2ns 
Tm-2ns 

-10ns 




7Tm+7ns 

5Tm+22ns 

Tm+22ns 

Tm+2ns 




1 



Notes 



1 These values assume no external memory cycle is In progress. If an external cycle is active, maximum time 
could be (1 EMI cycle Tmx)+(1 refresh cycle TRfLRfH)+(6 periods Tm), 
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Figure 8.16 IMS T800 memory request timing 
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Figure 8.17 IMS T800 DMA sequence at reset 
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Figure 8.18 IMS T800 operation of MemReq, MemGranted with external, refresh memory cycles 
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Figure 8.19 IMS T800 operation of MemReq, MemGranted with external, internal memory cycles 
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9 Events 

EventReq and EventAck provide an asynchronous handshake interface between an external event and an 
internal process. When an external event takes EventReq high the external event channel (additional to the 
external link channels) is made ready to communicate with a process. When both the event channel and the 
process are ready the processor takes EventAck high and the process, if waiting, is scheduled. EventAck 
is removed after EventReq goes low. 

Only one process may use the event channel at any given time. If no process requires an event to occur 
EventAck will never be taken high. Although EventReq triggers the channel on a transition from low to high, 
it must not be removed before EventAck is high. EventReq should be low during Reset; if not it will be 
ignored until it has gone low and returned high. EventAck is taken low when Reset occurs. 

If the process is a high priority one and no other high priority process is running, the latency is as described 
on page 197. Setting a high priority task to wait for an event input allows the user to interrupt a transputer 
program running at low priority. The time taken from asserting EventReq to the execution of the microcode 
interrupt handler in the CPU is four cycles. The following functions take place during the four cycles: 

Cycle 1 Sample EventReq at pad on the rising edge of ProcClockOut and synchronise. 

Cycle 2 Edge detect the synchronised EventReq and form the interrupt request. 

Cycle 3 Sample interrupt vector for microcode ROM in the CPU. 

Cycle 4 Execute the interrupt routine for Event rather than the next instruction. 

Table 9.1 Event 



SYMBOL 


PARAMETER 


MiN 


NOM 


MAX 


UNITS 


NOTE 


TVHKH 
TKHVL 
TVLKL 
TKLVH 


Event request response 

Event request hold 

Delay before removal of event acknowledge 

Delay before re-assertion of event request 










6Tm+7ns 


ns 
ns 

ns 





EventReq 
EventAck 












/ 

A 


\ 




A 


/ 


TVHKH ->^ 


^ TKHVL ^ 




-*- TVLKL 
. TKLVH . 










V 


^ 


L y^ 















Figure 9.1 IMS T800 event timing 
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Links 



Four identical INMOS bi-directional serial links provide synchronized communication between processors 
and with the outside world. Each link comprises an input channel and output channel. A link between two 
transputers is implemented by connecting a link interface on one transputer to a link interface on the other 
transputer. Every byte of data sent on a link is acknowledged on the input of the same link, thus each signal 
line carries both data and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged data byte 
and that the receiving link is able to receive another byte. The sending link reschedules the sending process 
only after the acknowledge for the final byte of the message has been received. 

The IMS T800 links allow an acknowledge packet to be sent before the data packet has been fully received. 
This overlapped acknowledge technique is fully compatible with all other INT^OS transputer links. 

The IMS T800 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they can 
be used at 5 or 20 Mbits/sec. Links are not synchronised with Clockln or ProcClockOut and are insensitive 
to their phases. Thus links from independently clocked systems may communicate, providing only that the 
clocks are nominally identical and within specification. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

Link speeds can be set by LinkSpecial, LinkOSpeclal and Link123Special. The link speed can be set 
independently. Table 10.1 shows uni-directional and bi-directional data rates in Kbytes/sec for each link 
speed; LinknSpecial is to be read as LinkOSpeclal when selecting link speed and as Link123Special for 
the others. Data rates are quoted for a transputer using internal memory, and will be affected by a factor 
depending on the number of external memory accesses and the length of the external memory cycle. 

Table 10.1 Speed Settings for Transputer Links 



Link 
Special 


Linkn 
Special 


Mbits/sec 


Kbytes/sec 


Uni 


Bi 




1 
1 



1 


1 


10 
5 
10 
20 


910 

450 

910 

1740 


1250 

670 

1250 

2350 





H L, 
1 Ack 1 


H'H 01234567 L, 


1 Data 1 



Figure 10.1 IMS T800 link data and acknowledge packets 
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Table 10.2 Link 



SYIVIBOL 


PARAMETER 


MiN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




10Mbits/s 






10 


ns 


2 




5 Mbits/s 






30 


ns 


2 


CLIZ 


Linkin capacitance @ f=1MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for 1 0On transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 




Figure 10.2 IMS T800 link timing 
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Figure 10.3 IMS T800 buffered link timing 
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Transputer family device A 


LinkOut 
Linkln 


,^ 


Linkln 
LinkOut 










Transputer family device B 



Figure 10.4 IMS T800 Links directly connected 



Transputer family device A 



LinkOut 
Linkln 



3- 
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Zo=100ohms 
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Zo=100ohms 



RM 



Linkln 
LinkOut 



Transputer family device B 



Figure 10.5 IMS T800 Links connected by transmission line 
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Figure 10.6 IMS T800 Links connected by buffers 
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11.1 DC electrical characteristics 



Table 11.1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


vcc 


DC supply voltage 





7.0 


V 


1.2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1.2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



1 Ail voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused Inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 1 1 .2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 


4.75 


5.25 


V 


1 


VI, VO 


Input or output voltage 





VCC 


V 


1,2 


CL 


Load capacitance on any pin 




60 


PF 




TA 


Operating temperature range IMS T800-S 





70 


°C 


3 


TA 


Operating temperature range IMS T800-M 


-55 


125 


°C 


3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 11 .3 DC characteristics 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VCC+0.5 


V 


1,2 


VIL 


Low level input voltage 


-0.5 


0.8 


V 


1,2 


il 


Input current @ GND<VI<VCC 




±10 


/iA 


1,2 


VOH 


Output high voltage @ IOH=2mA 


VCC-1 




V 


1,2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1,2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1,2,3,6 






65 


100 


mA 


1,2.4,6 


lOZ 


Tristate output current @ GND<VO<VCC 




±10 


/iA 


1,2 


PD 


Power dissipation 




1.2 


W 


2,5 


CIN 


Input capacitance @ f=1MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=i MHz 




10 


PF 


6 



Notes 



1 AH voltages are with respect to GND. 

2 Parameters for IMS T800-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading and program execution. 
Power dissipation for processor operating at 20 MHz. 

6 This parameter is sampled and not 100% tested. 



11.2 Equivalent circuits 



VCC — J— 

RlN 

Oiitniit r - 1-^ 1 










Load for: 


R1 


R2 


Equivalent load: 




Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky TTL inputs 


50pF = 

HMn 


1^ T 


Diodes are 1N916 



















Figure 11.1 Load circuit for AC measurements 
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Vdd-1 \ 
Inputs VIH 

OV 

Vdd-1 
Inputs VIL 
OV ^ 


\ 




/ 

tpHL 




*< j^ 


Vdd 
Outputs 

OV 


\l.5V 


tpLH 




Vdd 
Outputs 

OV 


yM.5V 





Figure 11.2 AC measurements timing waveforms 



Test point 
Output under test 



X" 




vcc 



510R 



50pFUp 
GND ■ 1— 



Figure 11.3 Tristate load circuit for AC measurements 
11.3 AC timing characteristics 

Table 1 1 .4 Input, output edges 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


TDr 


Input rising edges 


2 


20 


ns 


1,2 


TDf 


Input falling edges 


2 


20 


ns 


1,2 


TQr 


Output rising edges 




25 


ns 


1 


TQf 


Output falling edges 




15 


ns 


1 


TSOLaHZ 


Address high to tristate 


a 


a+6 


ns 


3 


TSOLaLZ 


Address low to tristate 


a 


a+6 


ns 


3 



Notes 



1 Non-link pins; see section on links. 

2 All inputs except Clockln; see section on Clockln. 

3 a is T2 where T2 can be from one to four periods Tm in length. 
Address lines include MemnotWrDO, MemnotRfDI, MemAD2-31. 
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Figure 11.4 IMS T800 input and output edge timing 
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Figure 11.5 IMS T800 tristate timing relative to notMemSO 
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Load Capacitance pF 
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Notes 



Figure 11.6 Typical rise/fall times 



1 Skew is measured between notiVlemSO with a standard load (2 Schottky TTL inputs and 30pF) and 
notl\flemSO with a load of 2 Schottky TTL inputs and varying capacitance. 
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11.4 Power rating 

Internal power dissipation Pint of transputer and peripheral chips depends on VCC, as shown in figure 1 1 .7. 
Pint is substantially independent of temperature. 

Total power dissipation Pd of the chip is 

Pd - Pint + Pio 
where Pio is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj^TA-^ejA*PD 

where Ta is the external ambient temperature in °C and SJa is the junction-to-ambient thermal resistance in 
°C/W. OJa for each package is given in the Packaging Specifications section. 



800- 

700- 

Power 

PINT 600 H 

mW 

500- 




T800-25 

T800-20 
T800-17 



I I I \ I I \ I I I I I I p 
4.4 4.6 4.8 5.0 5.2 5.4 5.6 
VCC Volts 



Figure 1 1 .7 IMS T800 internal power dissipation vs VCC 




Figure 11.8 IMS T800 typical power dissipation with processor speed 
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Performance 



The performance of the transputer is measured in terms of the number of bytes required for the program, and 
the number of (internal) processor cycles required to execute the program. The figures here relate to OCCam 
programs. For the same function, other languages should achieve approximately the same performance as 
Occam. 

With transputers incorporating an FPU, this type of performance calculation is straight forward when consider- 
ing only integer data types. However, when floating point calculations using the REAL32 and REAL64 data 
types are present in the program, complications arise due to the concurrency inherent in the transputer's de- 
sign whereby integer calculations can be overlapped with floating point calculations. A more comprehensive 
guide to the impact of this concurrency on transputer performance can be found in the Transputer Instruction 
Set - A Compiler Writers' Guide. 



12.1 



Performance overview 



These figures are averages obtained from detailed simulation, and should be used only as an initial guide; 
they assume operands are of type int. The abbreviations in table 12.1 are used to represent the quantities 
indicated. In the replicator section of the table, figures in braces {} are not necessary if the number of 
replications is a compile time constant. To estimate performance, add together the time for the variable 
references and the time for the operation. 



Table 12.1 Key to performance table 



np 

ne 

r 

ts 

w 

P 

Eg 

Et 

Tb 

Tbp 

Tbc 

nsp 

nap 



number of component processes 

number of processes earlier in queue 

1 if INT parameter or array parameter, if not 

number of table entries (table size) 

width of constant in nibbles 

number of places to shift 

expression used in a guard 

timer expression used in a guard 

most significant bit set of multiplier ((-1) if the multiplier is 0) 

most significant bit set in a positive multiplier when counting from zero ((-1) if the multiplier is 0) 

most significant bit set in the two's complement of a negative multiplier 

Number of scalar parameters in a procedure 

Number of array parameters in a procedure 
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Table 12.2 Performance 





Size (bytes) 


Time (cycles) 


Names 

variables 

in expression 
assigned to or input to 
in PRQC or FUNCTION call, 
corresponding to an INT parameter 

channels 


1.1+r 
1.1+r 

1.1+r 
1.1 


2.1+2(r) 
1.1 +(r) 

1.1 +(r) 
2.1 


Array Variables (for single dimension arrays) 
constant subscript 
variable subscript 
expression subscript 




5.3 
5.3 




7.3 
7.3 


Declarations 

CHAN OF protocol 
[size] CHAN OF protocol 
PROC 


3.1 

9.4 

body+2 


3.1 

2.2 + 20.2*size 




Primitives 

assignment 

input 

output 

STOP 

SKIP 



4 
1 
2 





26.5 

26 

25 




Arithmetic operators 

+ 
* 

/ 

REM 

» « 


1 
2 
2 
2 
2 


1 

39 
40 
38 
3+p 


Modulo Arithmetic operators 
PLUS 
MINUS 

TIMES (fast multiply, positive operand) 
TIMES (fast multiply, negative operand) 


2 

1 
1 
1 


2 

1 
4+Tbp 
5+Tbc 


Boolean operators 

OR 

AND NOT 


4 
1 


8 
2 


Comparison operators 
= constant 
= variable 
<> constant 
<> variable 
> < 
>= <= 



2 
1 
3 
1 
2 


1 
3 
3 
5 
2 
4 


Bit operators 

/\ V X - 


2 


2 


Expressions 

constant in expression 
check if error 


w 

4 


w 

6 
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Table 12.3 Performance 





Size (bytes) 


Time (cycles) 


Timers 






timer input 


2 


3 


timer AFTER 






if past time 


2 


4 


with empty timer queue 


2 


31 


non-empty timer queue 


2 


38+ne*9 


ALT (timer) 






with empty timer queue 


6 


52 


non-empty timer queue 


6 


59+ne*9 


timer alt guard 


8+2Eg+2Et 


34+2Eg+2Et 


Constructs 






SEQ 








IF 


1.3 


1.4 


if guard 


3 


4.3 


ALT (non timer) 


6 


26 


alt channel guard 


10.2+2Eg 


20+2Eg 


skip alt guard 


8+2Eg 


10+2Eg 


PAR 


11.5+(np-1)*7.5 


19.5+(np-1)*30.5 


WHILE 


4 


12 


Procedure or function call 








3.5+(nsp-2)*1.1 


16.5+(nsp-2)*1.1 




+nap*2.3 


+nap*2.3 


Replicators 






replicated SEQ 


7.3{+5.1} 


(-3.8)+15.1*count{+7.1} 


replicated if 


12.3{+5.1} 


(-2.6)+19.4*count{+7.1} 


replicated ALT 


24.8{+10.2} 


25.4+33.4*count{+14.2} 


replicated timer ALT 


24.8{+10.2} 


62.4+33.4*count{+14.2} 


replicated par 


39.1 {+5.1} 


(-6.4)+70.9*COunt{+7.1} 



12.2 Fast multiply, TIMES 

The IMS T800 has a fast integer multiplication instruction product For a positive multiplier its execution time 
is 4+Tbp cycles, and for a negative multiplier 5+Tbc cycles (table 12.1). The time taken for a multiplication 
by zero is 3 cycles. 

Implementations of high level languages on the transputer may take advantage of this instruction. For example, 
the Occam modulo arithmetic operator TIMES is implemented by the instruction and the right-hand operand is 
treated as the multiplier. The fast multiplication instruction is also used in high level language implementations 
for the multiplication implicit in multi-dimensional array access. 
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12.3 



Arithmetic 



A set of functions are provided within the development system to support the efficient implementation of 
multiple length integer arithmetic. In the IMS T800, floating point arithmetic is taken care of by the FPU. In 
table 12.4 n gives the number of places shifted and all arguments and results are assumed to be local. Full 
details of these functions are provided in the Occam reference manual, supplied as part of the development 
system and available as a separate publication. 

When calculating the execution time of the predefined maths functions, no time needs to be added for calling 
overhead. These functions are compiled directly into special purpose instructions which are designed to 
support the efficient implementation of multiple length integer arithmetic and floating point arithmetic. 

Table 12.4 Arithmetic performance 









+ cycles for 




Function 




Cycles 


parameter access 


J_ 


LONGADD 




2 


7 




L0N6SUM 




3 


8 




LONGSUB 




2 


7 




LONGDIFF 




3 


8 




LONGPROD 




34 


8 




LONGDIV 




36 


8 




SHIFTRIGHT 


(n<32) 
(n>=32) 


4+n 
n-27 


8 




SHIFTLEFT 


(n<32) 
(n>=32) 


4+n 
n-27 


8 




NORMALISE 


(n<32) 

(n>=32) 

(n=64) 


n+6 
n-25 

4 


7 




ASHIFTRIGHT 




SHIFTRIGHT+2 


5 




ASHIFTT.F.FT 




SHIFTLF.FT+4 


5 




ROTATERIGHT 




SHIFTRIGHT 


7 




ROTATELEFT 




SHIFTLEFT 


7 




FRACMUL 




LONGPROD+4 


5 





t Assuming local variables. 
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12.4 Floating point operations 

All references to REAL32 or REAL64 operands within programs compiled for the IMS T800 normally produce 
the following performance figures. 

Table 12.5 Floating point performance 





Size (bytes) 


REAL32 Time (cycles) 


REAL64 Time (cycles) 


Names 








variables 








in expression 


3.1 


3 


5 


assigned to or input to 


3.1 


3 


5 


in PROC or FUNCTION call, 








corresponding to a REAL 








parameter 


1.1+r 


1.1+r 


1.1+r 


Arithmetic operators 








+ 


2 


7 


7 


* 


2 


11 


20 


/ 


2 


17 


32 


REM 


11 


19 


34 


Comparison operators 








= 


2 


4 


4 


<> 


3 


6 


6 


> < 


2 


5 


5 


>= <= 


3 


7 


7 


Conversions 








REAL32 to - 


2 




3 


REAL64 to - 


2 


6 




To INT32 from - 


5 


9 


9 


To INT64 from - 


18 


32 


32 


INT32 to - 


3 


7 


7 


INT64 to - 


14 


24 


22 



12.4.1 Floating point functions 

These functions are provided by the development system. They are compiled directly into special purpose 
instructions designed to support the efficient implementation of some of the common mathematical functions 
of other languages. The functions provide ABS and SQRT for both REAL32 and REAL64 operand types. 

Table 12.6 IMS T800 floating point arithmetic performance 



Function 


Cycles 


+ cycles for parameter access f 


REAL32 


REAL 64 


ABS 
SQRT 
DABS 
DSQRT 


2 

118 
2 

244 


8 
8 


12 
12 



t Assuming local variables. 
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12.4.2 Special purpose functions and procedures 

The functions and procedures given in tables 12.8 and 12.9 are provided by the development system to give 
access to the special instructions available on the IMS T800. Table 12.7 shows the key to the table. 



Table 12.7 Key to special performance table 



Tb 
n 
r 
nr 



most significant bit set in the word counting from zero 
number of words per row (consecutive memory locations) 
number of rows in the two dimensional move 
number of bits to reverse 



Table 12.8 Special purpose functions performance 



Function 


Cycles 


+ cycles for 
parameter access f 


BITCOUNT 


2+Tb 


2 


CRCBYTE 


11 


8 


CRCWORD 


35 


8 


BITREVNBIT 


5+nr 


4 


BITREVWORD 


36 


2 



t Assuming local variables. 



Table 12.9 Special purpose procedures performance 



Procedure 


Cycles 


+ cycles for 
parameter access f 


M0VE2D 
DRAW2D 
CLIP2D 


8+(2n+23)*r 
8+(2n+23)*r 
8+(2n+23)*r 


8 
8 
8 



t Assuming local variables. 



12.5 Effect of external memory 

Extra processor cycles may be needed when program and/or data are held in external memory, depending 
both on the operation being performed, and on the speed of the external memory. After a processor cycle 
which initiates a write to memory, the processor continues execution at full speed until at least the next 
memory access. 

Whilst a reasonable estimate may be made of the effect of external memory, the actual performance will 
depend upon the exact nature of the given sequence of operations. 

External memory is characterized by the number of extra processor cycles per external memory cycle, denoted 
as e. For the IMS T800, with the fastest external memory the value of e is 2; a typical value for a large external 
memory is 5. 

If a program is stored in external memory, and e has the value 2 or 3, then no extra cycles need be estimated 
for linear code sequences. For larger values of e, the number of extra cycles required for linear code 
sequences may be estimated at (e-3)/4. A transfer of control may be estimated as requiring e+3 cycles. 

These estimates may be refined for various constructs. In table 12.10 n denotes the number of components 
in a construct. In the case of IF, the n'th conditional is the first to evaluate to TRUE, and the costs include the 
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costs of the conditionals tested. The number of bytes in an array assignment or communication is denoted 
byb. 

Table 12.10 External memory performance 





IMS T800 


Program off !:hlp 


Data off chip 


Boolean expressions 


e-2 





IF 


3en-8 


en 


Replicated IF 


(6e-4)n+7 


(5e-2)n+8 


Replicated SEQ 


(3e-3)n+2 


(4e-2)n 


PAR 


(3e-1)n+8 


3en+4 


Replicated PAR 


(10e-8)n+8 


16en-12 


ALT 


(2e-4)n+6e 


(2e-2)n+10e-8 


Array assignment and 





max (2e. e(b/2)) 


communication in 






one transputer 







The following simulation results illustrate the effect of storing program and/or data in external memory. The 
results are normalized to 1 for both program and data on chip. The first program (Sieve of Erastosthenes) 
is an extreme case as it is dominated by small, data access intensive loops; it contains no concurrency, 
communication, or even multiplication or division. The second program is the pipeline algorithm for Newton 
Raphson square root computation. 

Table 12.11 IMS T800 external memory performance 





Program 


8=2 


e=3 


9=4 


e=5 


On chip 


Program off chip 


1 
2 


1.3 
1.1 


1.5 
1.2 


1.7 
1.2 


1.9 

1.3 




Data off chip 


1 
2 


1.5 
1.2 


1.8 
1.4 


2.1 
1.6 


2.3 

1.7 




Program and data off chip 


1 
2 


1.8 
1.3 


2.2 

1.6 


2.7 
1.8 


3.2 
2.0 





12.6 Interrupt latency 

If the process is a high priority one and no other high priority process is running, the latency is as described 
in table 12.12. The timings given are in full processor cycles TPCLPCL; the number of Tm states is also 
given where relevant. Maximum latency assumes all memory accesses are internal ones. 



Table 12.12 Interrupt latency 





Typical 


Maximum 


TPCLPCL 


Tm 


TPCLPCL 


Tm 


IMS T800 with FPU in use 
IMS T800 with FPU not in use 


19 
19 


38 
38 


78 
58 


156 
116 
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13 Package specifications 
13.1 84 pin grid array package 
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Figure 13.1 IMS T800 84 pin grid array package pinout 
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Figure 13.2 84 pin grid array package dimensions 



Table 13.1 84 pin grid array package dimensions 



DIIVI 


l\/lillimetres 


Inches 


Notes 


NOIVI 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B1 


17.019 


±0.127 


0.670 


±0.005 




82 


18.796 


±0.127 


0.740 


±0.005 




C 


2.456 


±0.278 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.025 


0.018 


±0.002 


Pin diameter 


G 


1.143 


±0.127 


0.045 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Package weight is approximately 7.2 grams 

Table 13.2 84 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAiVIETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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13.2 84 lead quad cerpack package 

The leads are unformed to allow the user to form them to specific requirements. 
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Figure 13.3 IMS T800 84 lead quad cerpack package pinout 
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Figure 13.4 84 lead quad cerpack package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


38.100 


±0.508 


1.500 


±0.020 




B 


26.924 


±0.305 


1.060 


±0.012 




C 


20.574 


±0.203 


0.810 


±0.008 




D 


19.558 


±0.254 


0.770 


±0.010 




E 


0.508 




0.020 






F 


1.270 


±0.051 


0.050 


±0.002 




G 


2.489 


±0.305 


0.098 


±0.012 




H 


0.635 


±0.076 


0.025 


±0.003 




J 


1.143 


±0.102 


0.045 


±0.004 




K 


3.099 




0.122 




Max. 


L 


27.940 




1.100 




Max. 


M 


0.178 


±0.025 


0.007 


±0.001 





Table 13.3 84 lead quad cerpack package dimensions 
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14 Ordering 

This section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 

Table 14.1 IMS T800 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IMS T800-G17S 
IMS T800-G20S 
IMS T800-G25S 
IMS T800-G30S 


17.5 MHz 
20.0 MHz 
25.0 MHz 
30.0 MHz 


57 ns 
50 ns 
40 ns 
33 ns 


3.5 
4.0 
5.0 
6.0 


Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 


IMS T800-G17M 
IMS T800-G20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Ceramic Pin Grid MIL Spec 
Ceramic Pin Grid MIL Spec 


IMS T800-Q17M 
IMS T800-Q20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Quad Cerpack MIL Spec 
Quad Cerpack MIL Spec 



The timing parameters in this datasheet are based on full characterisation of the 17 MHz and 20 MHz 
parts. Data for higher speeds is based on tests on a limited number of samples and may change when full 
characterisation is completed. 
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IMS T425 

transputer 



DDDmos 



Advance Data 



FEATURES 

32 bit architecture 

33 ns internal cycle time 
30 MIPS (peak) instruction rate 
Pin compatible with IMS T805, IMS T800 and IMS T414 
Debugging support 

4 Kbytes on-chip static RAM 

120 Mbytes/sec sustained data rate to internal memory 

4 Gbytes directly addressable external memory 

40 Mbytes/sec sustained data rate to external memory 

630 ns response to interrupts 

Four INMOS serial links 5/10/20 Mbits/sec 

High performance graphics support with block move 

instructions 

Boot from ROM or communication links 

Single 5 MHz clock input 

Single -»-5V ±5% power supply 

MIL-STD-883C processing will be available 

APPLICATIONS 

High speed multi processor systems 

High performance graphics processing 

Supercomputers 

Workstations and workstation clusters 

Digital signal processing 

Accelerator processors 

Distributed databases 

System simulation 

Telecommunications 

Robotics 

Fault tolerant systems 

Image processing 

Pattern recognition 

Artificial intelligence 
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1 Introduction 

The IMS T425 transputer is a 32 bit CMOS microcomputer witii grapliics support. It has 4 Kbytes on-chip 
RAM for high speed processing, a configurable memory interface and four standard INMOS communication 
links. The instruction set achieves efficient implementation of high level languages and provides direct support 
for the OCCann model of concurrency when using either a single transputer or a network. Procedure calls, 
process switching and typical interrupt latency are sub-microsecond. 

For convenience of description, the IMS T425 operation is split into the basic blocks shown in figure 1 .1 . 
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Figure 1.1 IMS T425 block diagram 



The processor speed of a device can be pin-selected in stages from 17.5 MHz up to the maximum allowed 
for the part. A device running at 30 MHz achieves an instruction throughput of 30 MIPS peak and 15 MIPS 
sustained. The extended temperature version of the device complies with MIL-STD-883C. 

High performance graphics support is provided by microcoded block move instructions which operate at the 
speed of memory. The two-dimensional block move instructions provide for contiguous block moves as well 
as block copying of either non-zero bytes of data only or zero bytes only. Block move instructions can be used 
to provide graphics operations such as text manipulation, windowing, panning, scrolling and screen updating. 

Cyclic redundancy checking (CRC) instructions are available for use on arbitrary length serial data streams, 
to provide error detection where data integrity is critical. Another feature of the IMS T425, useful for pattern 
recognition, is the facility to count bits set in a word. 
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The IMS T425 can directly access a linear address space of 4 Gbytes. The 32 bit wide memory interface 
uses multiplexed data and address lines and provides a data rate of up to 4 bytes every 100 nanoseconds 
(40 Mbytes/sec) for a 30 MHz device. A configurable memory controller provides all timing, control and DRAM 
refresh signals for a wide variety of mixed memory systems. 

System Services include processor reset and bootstrap control, together with facilities for error analysis. Error 
signals may be daisy-chained in multi-transputer systems. 

The standard INMOS communication links allow networks of transputer family products to be constructed by 
direct point to point connections with no external logic. The IMS T425 links support the standard operating 
speed of 1 Mbits/sec, but also operate at 5 or 20 Mbits/sec. Each link can transfer data bi-directionally at 
up to 2.35 Mbytes/sec. 

The IMS T425 is pin compatible with the IMS T800 and can be plugged directly into a circuit designed for 
that device. It has a number of additions to improve hardware interfacing and to facilitate software initialising 
and debugging. The improvements have been made in an upwards-compatible manner. Software should be 
recompiled, although no changes to the source code are necessary. 

The IMS T425-20 is also pin compatible with the IMS T414-20, as the extra inputs used are all held to ground 
on the IMS T414. The IMS T425-20 can thus be plugged directly into a circuit designed for a 20 MHz version 
of the IMST414. 

The transputer is designed to implement the OCCam language, detailed in the OCcam Reference Manual, but 
also efficiently supports other languages such as C, Pascal and Fortran. Access to the transputer at machine 
level is seldom required, but if necessary refer to the Transputer Instruction Set - A Compiler Writers' Guide. 
The instruction set of the IMS T425 is the same as that of the IMS T800, except that the IMS T800 floating 
point instructions are replaced by the IMS T41 4 floating point support instructions. 

This data sheet supplies hardware implementation and characterisation details for the IMS T425. It is intended 
to be read in conjunction with the Transputer Architecture chapter, which details the architecture of the 
transputer and gives an overview of occann. 
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Pin designations 



Table 2.1 IMS T425 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPfus, CapWinus 




External capacitor for internal clock power supply 


Clocldn 


in 


Input clock 


ProcSpeedSeiectO-2 


in 


Processor speed selectors 


Reset 


in 


System reset 


Error 


out 


Error indicator 


Errorin 


in 


Error daisychain input 


Analyse 


in 


Error analysis 


BootFromRom 


in 


Boot from external ROM or from link 


DisableintRAIVI 


in 


Disable internal RAM 



Table 2.2 IMS T425 external memory interface 



Pin 


In/Out 


Function 


ProcClockOut 


out 


Processor clock 


MemnotWrDO 


in/out 


Multiplexed data bit and write cycle warning 


MemnotRfDI 


in/out 


Multiplexed data bit 1 and refresh warning 


MemAD2-31 


in/out 


Multiplexed data and address bus 


notlVlemRd 


out 


Read strobe 


notlVlemWrBO-3 


out 


Four byte-addressing write strobes 


notMemSO-4 


out 


Five general purpose strobes 


notMemRf 


out 


Dynamic memory refresh indicator 


RefreshPending 


out 


Dynamic refresh is pending 


MemWait 


in 


Memory cycle extender 


MemReq 


in 


Direct memory access request 


IVIemG ranted 


out 


Direct memory access granted 


MemConfig 


in 


Memory configuration data input 



Table 2.3 IMS T425 event 



Pin 


In/Out 


Function 


EventReq 
EventAck 
EventWaiting 


in 

out 
out 


Event request 

Event request acknowledge 

Event input requested by software 



Table 2.4 IMS T425 link 



Pin 


In/Out 


Function 


LinklnO-3 


in 


Four serial data input channels 


LinkOutO-3 


out 


Four serial data output channels 


LinkSpecial 


in 


Select non-standard speed as 5 or 20 Mbits/sec 


LinkOSpecial 


in 


Select special speed for Link 


Link123Special 


in 


Select special speed for Links 1 ,2,3 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 326. 
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3 Processor 

The 32 bit processor contains instruction processing logic, instruction and work pointers, and an operand 
register. It directly accesses the high speed 4 Kbyte on-chip memory, which can store data or program. 
Where larger amounts of memory or programs in ROM are required, the processor has access to 4 Gbytes 
of memory via the External Memory Interface (EMI). 



3.1 



Registers 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; six registers are used in the execution of a sequential process. The small number of 
registers, together with the simplicity of the instruction set, enables the processor to have relatively simple 
(and fast) data-paths and control logic. The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The A, B and C registers which form an evaluation stack. 

A, B and C are sources and destinations for most arithmetic and logical operations. Loading a value into the 
stack pushes B into C, and A into B, before loading A. Storing a value from A, pops B into A and C into B. 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the aofof instruction adds the top two values in the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 

Any location in memory can be accessed relative to the workpointer register, enabling the workspace to be 
of any size. 

Further register details are given in Transputer Instruction Set - A Compiler Writers' Guide. 
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Figure 3.1 Registers 
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3.2 



Instructions 



The instruction set has been designed for simple and efficient compilation of high-level languages. All in- 
structions have the same format, designed to give a compact representation of the operations occurring most 
frequently in programs. 

Each instruction consists of a single byte divided into two 4-bit parts. The four most significant bits of the byte 
are a function code and the four least significant bits are a data value. 





Function Data 


7 4 3 

1 ' 


Operand Register 





Figure 3.2 Instruction format 



3.2.1 



Direct functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Ten of these, 
shown in table 3.1 , are used to encode the most important functions. 

Table 3.1 Direct functions 



load constant 


add constant 




load local 


store local 


load local pointer 


load non-local 


store non-local 




Jump 


conditional Jump 


call 



The most common operations in a program are the loading of small literal values and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local an6 store /oca/ instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of high level programming languages such as OCCam, C, Fortran, Pascal or ADA. 

3.2.2 Prefix functions 



Two more function codes allow the operand of any instruction to be extended in length; prefix and negative 
prefix. 

All instructions are executed by loading the four data bits into the least significant four bits of the operand 
register, which is then used as the instruction's operand. All instructions except the prefix instructions end by 
clearing the operand register, ready for the next instruction. 

The prefix instruction loads its four data bits into the operand register and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 
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The use of prefix instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form independent of the processor 
wordlength. 

3.2.3 Indirect functions 

The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate instruction just like any 
other. The instruction representation therefore provides for an indefinite number of operations. 

Encoding of the indirect functions is chosen so that the most frequently occurring operations are represented 
without the use of a prefix instruction. These include arithmetic, logical and comparison operations such as 
add, exclusive or and greater than. Less frequently occurring operations have encodings which require a 
single prefix operation. 

3.2.4 Expression evaluation 

Evaluation of expressions sometimes requires use of temporary variables in the workspace, but the number 
of these can be minimised by careful choice of the evaluation order. 

Table 3.2 Expression evaluation 



Program 


Mnemonic 


X :=0 


Idc 

StI X 


X := #24 


pfix 2 
Idc 4 

StI X 


X := y + z 


Idl y 
Idl z 
add 

StI X 



3.2.5 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte; that is, without 
the use of prefix instructions. Many of these instructions, such as load constant and add require Just one 
processor cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive four instructions for every fetch. 

Short instructions also improve the effectiveness of instruction pre-fetch, which in turn improves processor 
performance. There is an extra word of pre-fetch buffer, so the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer is short, there is little time penalty when a jump instruction causes 
the buffer contents to be discarded. 
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3.3 



Processes and concurrency 



A process starts, performs a number of actions, and then either stops without completing or terminates 
complete. Typically, a process is a sequence of instructions. A transputer can run several processes in 
parallel (concurrently). Processes may be assigned either high or low priority, and there may be any number 
of each (page 269). 

The processor has a microcoded scheduler which enables any number of concurrent processes to be exe- 
cuted together, sharing the processor time. This removes the need for a software kernel. 



At any time, a concurrent process may be 

Active 

Inactive - 



Being executed. 

On a list waiting to be executed. 

Ready to input. 

- Ready to output. 

- Waiting until a specified time. 

The scheduler operates in such a way that inactive processes do not consume any processor time. It allocates 
a portion of the processor's time to each process in turn. Active processes waiting to be executed are held 
in two linked lists of process workspaces, one of high priority processes and one of low priority processes 
(page 269). Each list is implemented using two registers, one of which points to the first process in the list, 
the other to the last. In the Linked Process List figure 3.3, process S is executing and P, Oand R are active, 
awaiting execution. Only the low priority process queue registers are shown; the high priority process ones 
perform in a similar manner. 
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Figure 3.3 Linked process list 



Table 3.3 Priority queue control registers 



Function 


High Priority 


Low Priority 


Pointer to front of active process list 
Pointer to back of active process list 


FptrO 
BptrO 


Fptn 
Bptn 



Each process runs until it has completed its action, but is descheduled whilst waiting for communication from 
another process or transputer, or for a time delay to complete. In order for several processes to operate in 
parallel, a low priority process is only permitted to run for a maximum of two time slices before it Is forcibly 
descheduled at the next descheduling point (page 273). The time slice period is 5120 cycles of the external 
5 MHz clock, giving ticks approximately 1 ms apart. 
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A process can only be descheduled on certain instructions, known as descheduling points (page 273). As a 
result, an expression evaluation can be guaranteed to execute without the process being timesliced part way 
through. 

Whenever a process is unable to proceed, its instruction pointer is saved in the process workspace and 
the next process taken fronn the list. Process scheduling pointers are updated by instructions which cause 
scheduling operations, and should not be altered directly. Actual process switch times are less than 1 /xs, as 
little state needs to be saved and it is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model, including start process 
and end process. When a main process executes a parallel construct, start process instructions are used 
to create the necessary additional concurrent processes. A start process instruction creates a new process 
by adding a new workspace to the end of the scheduling list, enabling the new concurrent process to be 
executed together with the ones already being executed. When a process is made active it is always added 
to the end of the list, and thus cannot pre-empt processes already on the same list. 

The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the parallel construct components which have still to terminate. The 
counter is initialised to the number of components before the processes are started. Each component ends 
with an end process instruction which decrements and tests the counter. For all but the last component, the 
counter is non zero and the component is descheduled. For the last component, the counter is zero and the 
main process continues. 

3.4 Priority 

The IMS T425 supports two levels of priority. Priority 1 (low priority) processes are executed whenever there 
are no active priority (high priority) processes. 

High priority processes are expected to execute for a short time. If one or more high priority processes are 
able to proceed, then one is selected and runs until it has to wait for a communication, a timer input, or until 
it completes processing. 

If no process at high priority is able to proceed, but one or more processes at low priority are able to proceed, 
then one is selected. 

Low priority processes are periodically timesliced to provide an even distribution of processor time between 
computationally intensive tasks. 

If there are n low priority processes, then the maximum latency from the time at which a low priority process 
becomes active to the time when it starts processing is 2n-2 timeslice periods. It is then able to execute for 
between one and two timeslice periods, less any time taken by high priority processes. This assumes that 
no process monopolises the transputer's time; i.e. it has a distribution of descheduling points (page 273). 

Each timeslice period lasts for 5120 cycles of the external 5 MHz input clock (approximately 1 ms at the 
standard frequency of 5 MHz). 

If a high priority process is waiting for an external channel to become ready, and if no other high priority 
process is active, then the interrupt latency (from when the channel becomes ready to when the process 
starts executing) is typically 19 processor cycles, a maximum of 58 cycles (assuming use of on-chip RAM). 

3.5 Communications 

Communication between processes is achieved by means of channels. Process communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 

A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
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links. The processor provides a number of operations to support message passing, the most important being 
input message and output message. 

The input message and output message instructions use the address of the channel to determine whether 
the channel is internal or external. Thus the same instruction sequence can be used for both, allowing a 
process to be written and compiled without knowledge of where its channels are connected. 

The process which first becomes ready must wait until the second one is also ready. A process performs an 
input or output by loading the evaluation stack with a pointer to a message, the address of a chanhel, and 
a count of the number of bytes to be transferred, and then executing an input message or output message 
instruction. Data is transferred if the other process is ready. If the channel is not ready or is an external one 
the process will deschedule. 

3.6 Block move 

The block move on the transputer moves any number of bytes from any byte boundary in memory, to any 
other byte boundary, using the smallest possible number of word read, and word or part-word writes. 

A block movp instruction can be interrupted by a high priority process. On interrupt, block move is completed 
to a word boundary, independent of start position. When restarting after interrupt, the last word written is 
written again. This appears as an unnecessary read and write in the simplest case of word aligned block 
moves, and may cause problems with FIFOs. This problem can be overcome by incrementing the saved 
destination (BreglntSaveLoc) and source pointer (CreglntSaveLoc) values by BytesPerWord during the high 
priority process. 

3.7 Timers 

The transputer has two 32 bit timer clocks which 'tick' periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a specific time. 

One timer is accessible only to high priority processes and is incremented every microsecond, cycling com- 
pletely in approximately 4295 seconds. The other is accessible only to low priority processes and is incre- 
mented every 64 microseconds, giving exactly 15625 ticks in one second. It has a full period of approximately 
76 hours. 

Table 3.4 Timer registers 



ClockO 
Clocf<1 
TNextRegO 
TNextRegl 



Current value of high priority (level 0) process clock 

Current value of low priority (level 1) process clock 

Indicates time of earliest event on high priority (level 0) timer queue 

Indicates time of earliest event on low priority (level 1) timer queue 



The current value of the processor clock can be read by executing a load timer instruction. A process can 
arrange to perform a timer input, in which case it will become ready to execute after a specified time has 
been reached. The //mer /npiyf instruction requires a time to be specified. If this time is in the 'past' then the 
instruction has no effect. If the time is in the 'future' then the process is descheduled. When the specified 
time is reached the process is scheduled again. 
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Figure 3.4 shows two processes waiting on the timer queue, one waiting for time 21 , the other for time 31 . 
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Figure 3.4 Timer registers 
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4 Instruction set summary 

The Function Codes table 4.7. gives the basic function code set (page 266). Where the operand is less 
than 16, a single byte encodes the complete instruction. If the operand is greater than 15, one prefix 
instruction (pfix) is required for each additional four bits of the operand. If the operand is negative the first 
prefix instruction will be nfix. 

Table 4.1 pref/x coding 









Function 


Memory 


Mnemonic 


code 


code 


Idc 




#3 


#4 


#43 


Idc 




#35 






is coded 


as 








pfix 




#3 


#2 


#23 


Idc 




#5 


#4 


#45 


Idc 




#987 






is coded 


as 








pfix 




#9 


#2 


#29 


pfix 




#8 


#2 


#28 


Idc 




#7 


#4 


#47 


Idc 




-31 (Idc 


#FFFFFFE1) 




is coded 


as 








nfix 




#1 


#6 


#61 


Idc 




#1 


#4 


#41 



Tables 4.8 to 4.21 give details of the operation codes. Where an operation code is less than 16 (e.g. add: 
operation code 05), the operation can be stored as a single byte comprising the operate function code F and 
the operand (5 in the example). Where an operation code is greater than 15 (e.g. ladd: operation code 16), 
the prefix function code 2 is used to extend the instruction. 



Table 4.2 operate coding 



Mnemonic 



Function 
code 



Memory 
code 



add 
is coded as 

opr 

ladd 
is coded as 

pfix 
opr 



(op. code #5) 

add 

(op. code #1 6) 

#1 
#6 



#F 



#2 
#F 



#F5 

#F5 

#21 F6 

#21 
#F6 



The load device identity {Iddevid} instruction (table 4.20) pushes the device type identity into the A register. 
Each product is allocated a unique group of numbers for use with the /c/cyeWo^ instruction. The product identity 
numbers for the IMS T425 are to 9 inclusive. 



The Processor Cycles column refers to the number of periods TPCLPCL taken by an instruction executing 
in internal memory. The number of cycles is given for the basic operation only; where the memory code 
for an instruction is two bytes, the time for the pirefix function (one cycle) should be added. For a 20 MHz 
transputer one cycle is 50 ns. Some instruction times vary. Where a letter is included in the cycles column it 
is interpreted from table 4.3. 
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Table 4.3 Instruction set interpretation 



Ident 



Interpretation 



b 
m 

n 
w 



P 
r 



Bit number of the highest bit set in register A. Bit is the least significant bit. 

Bit number of the highest bit set in the absolute value of register A. 
Bit is the least significant bit. 

Number of places shifted. 

Number of words in the message. Part words are counted as full words. If the message 
is not word aligned the number of words is increased to include the part words at either 
end of the message. 

Number of words per row. 

Number of rows. 



The DE column of the tables indicates the descheduling/error features of an instruction as described in 
table 4.4. 

Table 4.4 Instruction features 



Ident 


Feature 


See page: 


D 

E 


The instruction is a descheduling point 
The instruction will affect the Error flag 


273 
274, 285 



4.1 



Descheduling points 



The instructions In table 4.5 are the only ones at which a process may be descheduled (page 268). They are 
also the ones at which the processor will halt if the Analyse pin is asserted (page 284). 



Table 4.5 Descheduling point instructions 



input message 
timer ait wait 
jump 



output message 
timer input 
loop end 



output byte 
stop on error 
end process 



output word 
alt wait 
stop process 
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4.2 Error instructions 

The instructions in table 4.6 are the only ones which can affect the Error flag (page 285) directly. 

Table 4.6 Error setting instructions 



add 


add constant 


subtract 




multiply 


fractional multiply 


divide 


remainder 


long add 


long subtract 


long divide 




set error 


testerr 




cflerr 


check word 


check subscript from 


check single 


check count from 1 



4.3 



Debugging support 



Table 4.21 contains a number of instructions to facilitate the implementation of breakpoints. These instructions 
overload the operation of yO. Normally yO is a no-op which might cause descheduling. SetjObreak enables the 
breakpointing facilities and causes yO to act as a breakpointing instruction. When breakpointing is enabled, 
yO swaps the current Iptr and Wptr with an Iptr and Wptr stored above MemStart. The breakpoint instruction 
does not cause descheduling, and preserves the state of the registers. It is possible to single step the pro- 
cessor at machine level using these instructions. Refer to Support for debugging/breakpointing in transputers 
(technical note 61) for more detailed information regarding debugger support. 
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Table 4.7 IMS T425 function codes 



Function 


Memory 




Processor 




D 


Code 


Code 


IVInemonic 


Cycles 


Name 


E 





OX 


J 


3 


jump 


D 


1 


1X 


Idip 


1 


load local pointer 




2 


2X 


pfix 


1 


prefix 




3 


3X 


Idnl 


2 


load non-local 




4 


4X 


Idc 


1 


load constant 




5 


5X 


Idnip 


1 


load non-local pointer 




6 


6X 


nfix 


1 


negative prefix 




7 


7X 


Idl 


2 


load local 




8 


8X 


adc 


1 


add constant 


E 


9 


9X 


call 


7 


call 




A 


AX 


cj 


2 

4 


conditional jump (not taken) 
conditional jump (taken) 




B 


BX 


ajw 


1 


adjust workspace 




C 


OX 


eqc 


2 


equals constant 




D 


DX 


sti 


1 


store local 




E 


EX 


stnl 


2 


store non-local 




F 


FX 


opr 


- 


operate 





Table 4.8 IMS T425 arithmetic/logical operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


46 


24F6 


and 


1 


and 




4B 


24FB 


or 


1 


or 




33 


23F3 


xor 


1 


exclusive or 




32 


23F2 


not 


1 


bitwise not 




41 


24F1 


shI 


n-h2 


shift left 




40 


24F0 


shr 


n-h2 


shift right 




05 


F5 


add 


1 


add 


E 


OC 


FC 


sub 


1 


subtract 


E 


53 


25F3 


mul 


38 


multiply 


E 


72 


27F2 


fmul 


35 


fractional multiply (no rounding) 


E 








40 


fractional multiply (rounding) 


E 


2C 


22FC 


div 


39 


divide 


E 


1F 


21 FF 


rem 


37 


remainder 


E 


09 


F9 


gt 


2 


greater than 




04 


F4 


diff 


1 


difference 




52 


25F2 


sum 


1 


sum 




08 


F8 


prod 


b+A- 
m+5 


product for positive register A 
product for negative register A 
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Table 4.9 IMS T425 long arithmetic operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


16 


21 F6 


ladd 


2 


long add 


E 


38 


23F8 


Isub 


2 


long subtract 


E 


37 


23F7 


Isum 


3 


long sum 




4F 


24FF 


Idiff 


3 


long diff 




31 


23F1 


Imul 


33 


long multiply 




1A 


21 FA 


Idiv 


35 


long divide 


E 


36 


23F6 


IshI 


n+3 
n-28 


long shift left (n<32) 
long shift left(n>32) 




35 


23F5 


Ishr 


n+3 
n-28 


long shift right (n<32) 
long shift right (n>32) 




19 


21 F9 


norm 


n+5 
n-26 

3 


normalise (n<32) 
normalise (n>32) 
normalise (n=64) 





Table 4.10 IMS T425 general operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


00 


FO 


rev 


1 


reverse 




3A 


23FA 


xword 


4 


extend to word 




56 


25F6 


cword 


5 


check word 


E 


ID 


21 FD 


xdble 


2 


extend to double 




40 


24FC 


csngi 


3 


check single 


E 


42 


24F2 


mint 


1 


minimum integer 




5A 


25FA 


dup 


1 


duplicate top of stack 




79 


27F9 


pop 


1 


pop processor stack 





Table 4.1 1 IMS T425 floating point support operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


73 
90 
63 
6D 
60 

71 


27F3 
29F0 
26F3 
26FD 
26F0 

27F1 


cflerr 

fptesterr 

unpacksn 

roundsn 

postnormsn 

Idinf 


3 

1 

15 
12/15 
5/30 

1 


check floating point error 
load value true (FPU not present) 
unpack single length fp number 
round single length fp number 
post-normalise correction of 
single length fp number 

load single length infinity 


E 



Processor cycles are shown as Typical/Maximum cycles. 
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Table 4.12 IMS T425 2D block move operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


5B 


25FB 


move2dinit 


8 


initialise data for 2D block move 




50 


25FC 


move2dall 


(2p+23)*r 


2D block copy 




5D 


25 FD 


move2dnonzero 


(2p+23)*r 


2D block copy non-zero bytes 




5E 


25FE 


move2dzero 


(2p+23)*r 


2D block copy zero bytes 





Table 4.13 IMS T425 ORG and bit operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


74 
75 

76 
77 
78 


27F4 
27F5 

27F6 
27F7 
27F8 


crcword 
crcbyte 

bitcnt 

bitrevword 

bitrevnbits 


35 

11 

b+2 

36 

n+4 


calculate crc on word 
calculate crc on byte 

count bits set in word 
reverse bits in word 
reverse bottom n bits in word 





Table 4.14 IMS T425 indexing/array operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


02 


F2 


bsub 


1 


byte subscript 




OA 


FA 


wsub 


2 


word subscript 




81 


28F1 


wsubdb 


3 


form double word subscript 




34 


23F4 


bent 


2 


byte count 




3F 


23FF 


went 


5 


word count 




01 


F1 


lb 


5 


load byte 




3B 


23FB 


sb 


4 


store byte 




4A 


24FA 


move 


2w+8 


move message 





Table 4.15 IMS T425 timer handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


22 


22F2 


Idtimer 


2 


load timer 




2B 
4E 


22FB 
24FE 


tin 
talt 


30 
4 
4 


timer input (time future) 
timer input (time past) 
timer alt start 


D 
D 


51 
47 


25F1 
24F7 


taltwt 
enbt 


15 
48 
8 


timer alt wait (time past) 
timer alt wait (time future) 
enable timer 


D 
D 


2E 


22FE 


dist 


23 


disable timer 
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Table 4.16 IMS T425 input/output operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


07 
OB 
OF 
OE 


F7 
FB 
FF 
FE 


in 

out 

outword 
outbyte 


2W+19 
2W+19 

23 

23 


input message 
output message 
output word 
output byte 


D 
D 
D 
D 


43 
44 

45 


24F3 
24F4 

24F5 


alt 
altwt 

attend 


2 
5 

17 
4 


alt start 

alt wait (channel ready) 
alt wait (channel not ready) 
alt end 


D 
D 


49 
30 


24F9 
23F0 


enbs 
diss 


3 

4 


enable skip 
disable skip 




12 
48 

2F 


21 F2 
24F8 

22FF 


resetch 
enbc 

disc 


3 
7 
5 
8 


reset channel 
enable channel (ready) 
enable channel (not ready) 
disable channel 





Table 4.17 IMS T425 control operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


20 
IB 
3C 
06 
21 


22F0 
21 FB 
23FC 

F6 
22F1 


ret 

Idpi 

gajw 

gcall 

lend 


5 
2 
2 

4 
10 
5 


return 

load pointer to instruction 

general adjust workspace 

general call 

loop end (loop) 

loop end (exit) 


D 
D 



Table 4.18 IMS T425 scheduling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


OD 


FD 


startp 


12 


start process 


D 


03 


F3 


endp 


13 


end process 


D 


39 


23F9 


runp 


10 


run process 




15 


21 F5 


stopp 


11 


stop process 




IE 


21 FE 


Idpri 


1 


load current priority 
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Table 4.19 IMS T425 error handling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


13 


21 F3 


csubO 


2 


check subscript from 


E 


4D 


24FD 


ccnti 


3 


check count from 1 


E 


29 


22F9 


testerr 


2 
3 


test error false and clear (no error) 
test error false and clear (error) 




10 


21 FO 


seterr 


1 


set error 


E 


55 


25F5 


stoperr 


2 


stop on error (no error) 


D 


57 


25F7 


cirhalterr 


1 


clear halt-on-error 




58 


25F8 


sethalterr 


1 


set halt-on-error 




59 


25F9 


testhalterr 


2 


test halt-on-error 





Table 4.20 IMS T425 processor initialisation operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


2A 


22FA 


testpranal 


2 


test processor analysing 




3E 


23FE 


saveh 


4 


save high priority queue registers 




3D 


23FD 


savel 


4 


save low priority queue registers 




18 


21 F8 


sthf 




store high priority front pointer 




50 


25F0 


sthb 




store high priority back pointer 




10 


21 FO 


stif 




store low priority front pointer 




17 


21 F7 


stib 




store low priority back pointer 




54 


25F4 


sttimer 




store timer 




170 


2127F0 


Iddevid 




load device identity 




7E 


27FE 


Idmemstartval 




load value of memstart address 





Table 4.21 IMS T425 debugger support codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 





00 


jumpO 


3 

11 
13 


jump (break not enabled) 

jump (break enabled, high priority) 

jump (break enabled, low priority) 


D 


B1 


2BF1 


break 


9 

11 


break (high priority) 
break (low priority) 




B2 


2BF2 


cIrjObreak 


1 


clear jump break enable flag 




B3 


2BF3 


setjObreak 


1 


set jump break enable flag 




84 


2BF4 


testjObreak 


2 


test jump break enable flag set 




7A 


27FA 


timerdisableh 


1 


disable high priority timer interrupt 




7B 


27FB 


timerdisablel 


1 


disable low priority timer interrupt 




70 


27F0 


timerenableh 


6 


enable high priority timer interrupt 




7D 


27FD 


timerenablel 


6 


enable low priority timer interrupt 
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5 System services 

System services include all the necessary logic to initialise and sustain operation of the device. They also 
include error handling and analysis facilities. 



5.1 



Power 



Power is supplied to the device via the VCC and GND pins. Several of each are provided to minimise 
inductance within the package. All supply pins must be connected. The supply must be decoupled close to 
the chip by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer 
boards are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, othen^/ise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 



5.2 



CapPlus, CapMinus 



The internally derived power supply for internal clocks requires an external low leakage, low inductance 1//F 
capacitor to be connected between CapPlus and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz jnd 10 MHz. If a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total RGB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 
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Figure 5.1 Recommended PLL decoupling 



5.3 



Clockin 



Transputer family components use a standard clock frequency, supplied by the user on the Clockin input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockin, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer devices may be connected to a common clock, or may have individual clocks providing 
each one meets the specified stability criteria. In a multi-clock system the relative phasing of Clockin clocks 
is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant provided the 
specified limits of Clockin pulse widths are met. 

Oscillator stability is important. Clockin must be derived from a crystal oscillator; RG oscillators are not 
sufficiently stable. Clockin must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 5.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1,2.4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1.3 


TDC1 DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise time 






10 


ns 


1.5 


TDCf 


Clockln fall time 






8 


ns 


1.5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 10.3). 
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Figure 5.2 Clockln timing 



5.4 



ProcSpeedSelectO-2 



Processor speed of the IMS T425 is variable in discrete steps. The desired speed can be selected, up to the 
maximum rated for a particular component, by the three speed select lines ProcSpeedSelectO-2. The pins 
are tied high or low, according to the table below, for the various speeds. The pins are arranged so that the 
IMS T425 can be plugged directly into a board designed for a IMS T800. 

Only six of the possible speed select combinations are currently used; the other two are not valid speed 
selectors. The frequency of Clockln for the speeds given in the table is 5 MHz. 
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Table 5.2 Processor speed selection 



Proc 
Speed 
Select2 


Proc 
Speed 
Selecti 


Proc 
Speed 
SelectO 


Processor 

Clock 
Speed MHz 


Processor 

Cycle 
Time ns 


Notes 











20.0 


50.0 










1 


22.5 


44.4 







1 





25.0 


40.0 







1 


1 


30.0 


33.3 




1 








35.0 


28.6 




1 





1 






Invalid 


1 


1 





17.5 


57.1 




1 


1 


1 






Invalid 



Note: Inclusion of a speed selection in this table does not imply immediate availability. 



5.5 



Reset 



Reset can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. After VCC is 
valid Clockln should be running for a minimum period TDCVRL before the end of Reset. The falling edge of 
Reset initialises the transputer, triggers the memory configuration sequence and starts the bootstrap routine. 
Link outputs are forced low during reset; link inputs and EventReq should be held low. Memory request 
(DMA) must not occur whilst Reset is high but can occur before bootstrap (page 308). 

After the end of Reset there will be a delay of 144 periods of Clockln (figure 5.3). Following this, the 
MemWrDO, MemRfDI and MemAD2-31 pins will be scanned to check for the existence of a pre-programmed 
memory interface configuration (page 297). This lasts for a further 144 periods of Clockln. Regardless of 
whether a configuration was found, 36 configuration read cycles will then be performed on external memory 
using the default memory configuration (page 299), in an attempt to access the external configuration ROM. 
A delay will then occur, its period depending on the actual configuration. Finally eight complete and con- 
secutive refresh cycles will initialise any dynamic RAM, using the new memory configuration. If the memory 
configuration does not enable refresh of dynamic RAM the refresh cycles will be replaced by an equivalent 
delay with no external memory activity. 

If BootFromRom is high bootstrapping will then take place immediately, using data from external memory; 
otherwise the transputer will await an input from any link. The processor will be in the low priority state. 



Reset 

Action 



Delay 



Internal 



External 



configuration configuration 



Delay 



Refresh 



Boot 



Figure 5.3 IMS T425 post-reset sequence 



5.6 



Bootstrap 



The transputer can be bootstrapped either from a link or from external ROM. To facilitate debugging, Boot- 
FromRom may be dynamically changed but must obey the specified timing restrictions. It is sampled once 
only by the transputer, before the first instruction is executed after Reset is taken low. 

If BootFromRom is connected high (e.g. to VCC) the transputer starts to execute code from the top two bytes 
in external memory, at address #7FFFFFFE. This location should contain a backward jump to a program in 
ROM. Following this access, BootFromRom may be taken low if required. The processor is in the low priority 
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State, and the IV register points to MemStart (page 286). 



Table 5.3 Reset and Analyse 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 


Power valid before Reset 


10 






ms 




TRHRL 

TDCVRL 

TAHRH 

TRLAL 

TBRVRL 

TRLBRX 


Reset pulse width high 
Clockln running before Reset end 
Analyse setup before Reset 
Analyse hold after Reset end 
BootFromRom setup 
BootFromRom hold after Reset 


8 

10 

3 

1 










Clockln 

ms 

ms 
Clockln 

ms 

ms 


1 
2 

1 

3 


TALBRX 


BootFromRom hold after Analyse 










3 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 

3 Must be stable until after end of bootstrap period. See Bootstrap section. 
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Figure 5.4 Transputer reset timing with Analyse low 
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Figure 5.5 Transputer reset and analyse timing 
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If BootFromRom is connected low (e.g. to GND) the transputer will wait for the first bootstrap message to 
arrive on any one of its links. The transputer is ready to receive the first byte on a link within two processor 
cycles TPCLPCL after Reset goes low. 

If the first byte received (the control byte) is greater than 1 it is taken as the quantity of bytes to be input. The 
following bytes, to that quantity, are then placed in internal memory starting at location MemStart. Following 
reception of the last byte the transputer will start executing code at MemStart as a low priority process. 
BootFromRom may be taken high after reception of the last byte, if required. The memory space immediately 
above the loaded code is used as work space. Messages arriving on other links after the control byte has 
been received and on the bootstrapping link after the last bootstrap byte will be retained until a process inputs 
from them. 

5.7 Peek and poke 

Any location in Internal or external memory can be interrogated and altered when the transputer is waiting 
for a bootstrap from link. If the control byte is then eight more bytes are expected on the same link. The 
first four byte word is taken as an internal or external memory address at which to poke (write) the second 
four byte word. If the control byte is 1 the next four bytes are used as the address from which to peek (read) 
a word of data; the word is sent down the output channel of the same link. 

Following such a peek or poke, the transputer returns to its previously held state. Any number of accesses 
may be made in this way until the control byte is greater than 1 , when the transputer will commence reading 
its bootstrap program. Any link can be used, but addresses and data must be transmitted via the same link 
as the control byte. 

5.8 Analyse 

If Analyse is taken high when the transputer is running, the transputer will halt at the next descheduling point 
(page 273). From Analyse being asserted, the processor will halt within three time slice periods plus the 
time taken for any high priority process to complete. As much of the transputer status is maintained as is 
necessary to permit analysis of the halted machine. Processor flags Error, HaltOnError and EnableJOBreak 
are normally cleared at reset on the IMS T425; however, if Analyse is asserted the flags are not altered. 
Memory refresh continues. 

Input links will continue with outstanding transfers. Output links will not make another access to memory 
for data but will transmit only those bytes already in the link buffer. Providing there is no delay in link 
acknowledgement, the links should be inactive within a few microseconds of the transputer halting. 

Reset should not be asserted before the transputer has halted and link transfers have ceased. When Reset 
is taken low whilst Analyse is high, neither the memory configuration sequence nor the block of eight refresh 
cycles will occur; the previous memory configuration will be used for any external memory accesses. If 
BootFromRom is high the transputer will bootstrap as soon as Analyse is taken low, otherwise it will await a 
control byte on any link. If Analyse is taken low without Reset going high the transputer state and operation 
are undefined. After the end of a valid Analyse sequence the registers have the values given in table 5.4. 

Table 5.4 Register values after Analyse 



/ MemStart if bootstrapping from a link, or the external memory bootstrap address if 
bootstrapping from ROM. 

W MemStart if bootstrapping from ROM, or the address of the first free word after the 
bootstrap program if bootstrapping from link. 

A The value of / when the processor halted. 

B The value of IV when the processor halted, together with the priority of the process 
when the transputer was halted (i.e. the IV descriptor). 

C The ID of the bootstrapping link if bootstrapping from link. 
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5.9 



Error, Errorin 



The Error pin carries the OR'ed output of the internal Error flag and the Errorin input. If Error is high it 
indicates either that Errorin is high or that an error was detected in one of the processes. An internal error 
can be caused, for example, by arithmetic overflow, divide by zero, array bounds violation or software setting 
the flag directly (page 274). Once set, the Error flag is only cleared by executing the instruction testerr. The 
error is not cleared by processor reset, in order that analysis can identify any errant transputer (page 284). 

A process can be programmed to stop if the Error flag is set; it cannot then transmit erroneous data to other 
processes, but processes which do not require that data can still be scheduled. Eventually all processes 
which rely, directly or indirectly, on data from the process in error will stop through lack of data. Errorin does 
not directly affect the status of a processor in any way. 

By setting the HaltOn Error ^ag the transputer itself can be programmed to halt if Error becomes set. If Error 
becomes set after HaltOnErrorhas been set, all processes on that transputer will cease but will not necessarily 
cause other transputers in a network to halt. Setting HaltOn Error after Error will not cause the transputer to 
halt; this allows the processor reset and analyse facilities to function with the flags in indeterminate states. 

An alternative method of error handling Is to have the errant process or transputer cause all transputers 
to halt. This can be done by 'daisy-chaining' the Errorin and Error pins of a number of processors and 
applying the final Error output signal to the EventReq pin of a suitably programmed master transputer. Since 
the process state is preserved when stopped by an error, the master transputer can then use the analyse 
function to debug the fault. When using such a circuit, note that the Error flag is in an indeterminate state on 
power up; the circuit and software should be designed with this in mind. 

Error checks can be removed completely to optimise the performance of a proven program; any unexpected 
error then occurring will have an arbitrary undefined effect. 

If a high priority process pre-empts a low priority one, status of the Error and HaltOn Error flags Is saved for 
the duration of the high priority process and restored at the conclusion of it. Status of both flags is transmitted 
to the high priority process. Either flag can be altered in the process without upsetting the error status of any 
complex operation being carried out by the pre-empted low priority process. 

In the event of a transputer halting because of HaltOnError, the links will finish outstanding transfers before 
shutting down. If Analyse is asserted then all inputs continue but outputs will not make another access to 
memory for data. Memory refresh will continue to take place. 

After halting due to the Error flag changing from to 1 whilst HaltOnError \s set, register /points two bytes 
past the instruction which set Error. After halting due to the Analyse pin being taken high, register / points 
one byte past the instruction being executed. In both cases /will be copied to register A. 
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Figure 5.6 Error handling in a multi-transputer system 
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6 Memory 

The IMS T425 has 4 Kbytes of fast internal static memory for high rates of data throughput. Each internal 
memory access takes one processor cycle ProcCIockOut (page 288). The transputer can also access 
4 Gbytes of external memory space. Internal and external memory are part of the same linear address 
space. Internal RAM can be disabled by holding DisablelntRAM high. All internal addresses are then 
mapped to external RAM. This pin should not be altered after Reset has been taken low. 

IMS T425 memory is byte addressed, with words aligned on four-byte boundaries. The least significant byte 
of a word is the lowest addressed byte. 

The bits in a byte are numbered to 7, with bit the least significant. The bytes are numbered from 0, with 
byte the least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Internal memory starts at the most negative address #80000000 and extends to #80000FFF. User memory 
begins at #80000070; this location is given the name MemStart. An instruction Idmemstartval is provided to 
obtain the value of MemStart. 

The context of a process in the transputer model involves a workspace descriptor (WPtr) and an instruction 
pointer (IRr). WPtr is a word address pointer to a workspace in memory. IPtr points to the next instruction to 
be executed for the process which is the currently executing process. The context switch performed by the 
breakpoint instruction swaps the WPtr and IPtr of the currently executing process with the WPtr and IPtr held 
above MemStart. Two contexts are held above MemStart, one for high priority and one for low priority; this 
allows processes at both levels to have breakpoints. Note that on bootstrapping from a link, these contexts 
are overwritten by the loaded code. If this is not acceptable, the values should be peeked from memory 
before bootstrapping from a link. 

The reserved area of Internal memory below MemStart is used to implement link and event channels. 

Two words of memory are reserved for timer use, TPtrLocO for high priority processes and TPtrLod for low 
priority processes. They either indicate the relevant priority timer is not in use or point to the first process on 
the timer queue at that priority level. 

Values of certain processor registers for the current low priority process are saved in the reserved IntSaveLoc 
locations when a high priority process pre-empts a low priority one. Other locations are reserved for extended 
features such as block moves. 

External memory space starts at #80001000 and extends up through #00000000 to #7FFFFFFF. Memory 
configuration data and ROM bootstrapping code must be in the most positive address space, starting at 
#7FFFFF6C and #7FFFFFFE respectively. Address space immediately below this is conventionally used for 
ROM based code. 
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Figure 6.1 IMS T425 memory map 



1 These locations are used as auxiliary processor registers and should not be manipulated by the user. Like 
processor registers, their contents may be useful for implementing debugging tools (Analyse, page 284). For 
details see Transputer Instruction Set - A Compiler Writers' Guide. 
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7 External memory Interface 

The External Memory Interface (EMI) allows access to a 32 bit address space, supporting dynamic and static 
RAM as well as ROM and EPROM. EMI timing can be configured at Reset to cater for most memory types 
and speeds, and a program is supplied with the Transputer Development System to aid in this configuration. 

There are 17 internal configurations which can be selected by a single pin connection (page 297). If none 
are suitable the user can configure the interface to specific requirements, as shown in page 299. 

7.1 ProcClockOut 

This clock is derived from the internal processor clock, which is in turn derived from Clockln. Its period is 
equal to one internal microcode cycle time, and can be derived from the formula 

TPCLPCL = TDCLDCL / PLLx 

where TPCLPCL Is the ProcClockOut Period, TDCLDCL Is the Clockln Period and PLLx is the phase 
lock loop factor for the relevant speed part, obtained from the ordering details (Ordering section). 

The time value Tm is used to define the duration of Tstates and, hence, the length of external memory cycles; 
its value is exactly half the period of one ProcClockOut cycle (0.5*TPCLPCL), regardless of mark/space 
ratio of ProcClockOut. 

Edges of the various external memory strobes coincide with rising or falling edges of ProcClockOut. It should 
be noted, however, that there is a skew associated with each coincidence. The value of skew depends on 
whether coincidence occurs when the ProcClockOut edge and strobe edge are both rising, when both are 
falling or if either is rising when the other is falling. Timing values given in the strobe tables show the best 
and worst cases. If a more accurate timing relationship is required, the exact Tstate timing and strobe edge 
to ProcClockOut relationships should be calculated and the correct skew factors applied from the edge skew 
timing table 7.4. 

The timing parameters in the following tables are based on 1 7 MHz and 20 MHz parts. Data for higher speeds 
is based on tests on a limited number of samples and may change when full characterisation is completed. 

7.2 Tstates 

The external memory cycle is divided into six Tstates with the following functions: 
T1 Address setup time before address valid strobe. 
T2 Address hold time after address valid strobe. 
T3 Read cycle tristate or write cycle data setup. 
T4 Extendable data setup time. 
T5 Read or write data. 
T6 Data hold. 

Under normal conditions each Tstate may be from one to four periods Tm long, the duration being set during 
memory configuration. The default condition on Reset is that all Tstates are the maximum four periods Tm 
long to allow external initialisation cycles to read slow ROM. 

Period T4 can be extended indefinitely by adding externally generated wait states. 

An external memory cycle is always an even number of periods Tm in length and the start of T1 always 
coincides with a rising edge of ProcClockOut. If the total configured quantity of periods Tm is an odd 
number, one extra period Tm will be added at the end of T6 to force the start of the next T1 to coincide with 
a rising edge of ProcClockOut. This period is designated E in configuration diagrams (figure 7.11). 



7 External memory interface 



289 



Table 7.1 ProcClockOut 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLPCL 


ProcClockOut period 


a-2 


a 


a+2 


ns 


1.5 


TPCHPCL 


ProcClockOut pulse width high 


b-1 1 .5 


b 


b+3.5 


ns 


2,5 


TPCLPCH 


ProcClockOut pulse width low 




c 




ns 


3,5 


Tm 


ProcClockOut half cycle 


b-1 


b 


b+1 


ns 


2,5 


TPCstab 


ProcClockOut stability 






8 


% 


4,5 



Notes 



1 a is TDCLDCL/PLLx. 

2 b is 0.5*TPCLPCL (half the processor clock period). 

3 c is TPCLPCL-TPCHPCL. 

4 Stability is the variation of cycle periods between two consecutive cycles, measured at corresponding points on 
the cycles. 

5 This parameter is sampled and not 100% tested. 
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Figure 7.1 IMS T425 ProcClockOut timing 



7.3 



Internal access 



During an internal memory access cycle the external memory interface bus MemAD2-31 reflects the word 
address used to access internal RAM, MemnotWrDO reflects the read/write operation and MemnotRfDI is 
high; all control strobes are inactive. This is true unless and until a memory refresh cycle or DMA (memory 
request) activity takes place, when the bus will carry the appropriate external address or data. 

The bus activity is not adequate to trace the internal operation of the transputer in full, but may be used for 
hardware debugging in conjuction with peek and poke (page 284). 
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Figure 7.2 IMS T425 bus activity for internal memory cycle 
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7.4 MemAD2-31 

External memory addresses and data are multiplexed on one bus. Only the top 30 bits of address are 
output on the external memory interface, using pins MemAD2-31. They are normally output only during 
Tstates T1 and T2, and should be latched during this time. Byte addressing is carried out internally by the 
transputer for read cycles. For write cycles the relevant bytes in memory are addressed by the write strobes 
notMemWrBO-3. 

The data bus is 32 bits wide. It uses MemAD2-31 for the top 30 bits and MemnotRfDI and MemnotWrDO 
for the lower two bits. Read cycle data may be set up on the bus at any time after the start of 13, but must 
be valid when the transputer reads it at the end of T5. Data may be removed any time during 16, but must 
be off the bus no later than the end of that period. 

Write data is placed on the bus at the start of T3 and removed at the end of T6. If T6 is extended to force 
the next cycle Tmx (page 290) to start on a rising edge of ProcClockOut, data will be valid during this time 
also. 

7.5 MemnotWrDO 

During 11 and T2 this pin will be low if the cycle is a write cycle, othenwise it will be high. During Tstates T3 
to T6 It becomes bit of the data bus. In both cases it follows the general timing of MemAD2-31. 

7.6 MemnotRfDI 

During T1 and T2, this pin is low if the address on MemAD2-31 is a refresh address, otherwise it is high. 
During Tstates T3 to T6 it becomes bit 1 of the data bus. In both cases it follows the general timing of 
MemAD2-31. 

7.7 notMemRd 

For a read cycle the read strobe notMemRd is low during T4 and T5. Data is read by the transputer on the 
rising edge of this strobe, and may be removed immediately afterward. If the strobe duration is insufficient it 
may be extended by adding extra periods Tm to either or both of the Tstates T4 and T5. Further extension 
may be obtained by inserting wait states at the end of T4. 

In the read cycle timing diagrams ProcClockOut is included as a guide only; it is shown with each Tstate 
configured to one period Tm. 

7.8 notMemSO-4 

To facilitate control of different types of memory and devices, the EMI is provided with five strobe outputs, 
four of which can be configured by the user. The strobes are conventionally assigned the functions shown in 
the read and write cycle diagrams, although there is no compulsion to retain these designations. 

notMemSO is a fixed format strobe. Its leading edge is always coincident with the start of T2 and its trailing 
edge always coincident with the end of T5. 

The leading edge of notMemSI is always coincident with the start of T2, but its duration may be configured 
to be from zero to 31 periods Tm. Regardless of the configured duration, the strobe will terminate no later 
than the end of T6. The strobe Is sometimes programmed to extend beyond the normal end of Tmx. When 
wait states are Inserted into an EMI cycle the end of Tmx is delayed, but the potential active duration of the 
strobe is not altered. Thus the strobe can be configured to terminate relatively early under certain conditions 
(page 306). If notMemSI is configured to be zero it will never go low. 
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notMemS2, notMemSS and notMemS4 are identical in operation. They all terminate at the end of T5, but 
the start of each can be delayed from one to 31 periods Tm beyond the start of T2. If the duration of one of 
these strobes would take it past the end of T5 it will stay high. This can be used to cause a strobe to become 
active only when wait states are inserted. If one of these strobes is configured to zero it will never go low. 
Figure 7.5 shows the effect of Wait on strobes in more detail; each division on the scale is one period Tm. 

Table 7.2 Read 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaZdV 


Address tristate to data valid 









ns 




TdVRdH 


Data setup before read 


25 






ns 




TRdHdX 


Data hold after read 









ns 




TSOLRdL 


notK/lemSO before start of read 


a-4 


a 


8+4 


ns 


1 


TSOHRdH 


End of read from end of notMemSO 


-4 




4 


ns 




TRdLRdH 


Read period 


b-3 




b+5 


ns 


2 



Notes 



1 a is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

2 b is total of T4+Twait+T5 where T4, T5 can be from one to four periods Tm each in length and Twait may be 

any number of periods Tm in length. 
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Figure 7.3 IMS T425 external read cycle: static memory 
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Figure 7.4 IMS T425 external read cycle: dynamic memory 
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Table 7.3 IMS T425 strobe timing 



SYMBOL 


(") 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaVSOL 




Address setup before notMemSO 


a-8 






ns 


1 


TSOLaX 




Address hold after notMemSO 


b-8 


b 


b+8 


ns 


2 


TSOLSOH 




notMemSO pulse width low 


c-5 




c+6 


ns 


3 


TS0LS1L 


1 


notMemSI from notMemSO 


-4 




4 


ns 




TS0LS1H 


5 


notMemSI end from notMemSO 


d-1 




d+9 


ns 


4,6 


TS0HS1H 


9 


notMemSI end from notMemSO end 


e-8 




e+4 


ns 


5.6 


TS0LS2L 


2 


notMemS2 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS2H 


6 


notMemS2 end from notMemSO 


c-5 




c+7 


ns 


3 


TS0HS2H 


10 


notMemS2 end from notMemSO end 


-4 




7 


ns 




TS0LS3L 


3 


notMemS3 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS3H 


7 


notMemS3 end from notMemSO 


C-5 




c+7 


ns 


3 


TS0HS3H 


11 


notMemS3 end from notMemSO end 


-4 




7 


ns 




TS0LS4L 


4 


notMemS4 delayed after notMemSO 


f-6 




f+5 


ns 


7 


TS0LS4H 


8 


notMemS4 end from notMemSO 


c-5 




C+7 


ns 


3 


TS0HS4H 


12 


notMemS4 end from notMemSO end 


-4 




7 


ns 




Tmx 




Complete external memory cycle 




9 






8 



Notes 



1 a is T1 where T1 can be from one to four periods Tm in length. 

2 b is T2 where T2 can be from one to four periods Tm in length. 

3 c is total of T2+T3+T4+Twait+T5 where T2, T3, T4, T5 can be from one to four periods Tm each in length and 
Twait may be any number of periods Tm in length. 

4 d can be from zero to 31 periods Tm in length. 

5 e can be from -27 to +4 periods Tm in length. 

6 If the configuration would cause the strobe to remain active past the end of T6 it will go high at the end of T6. 
If the strobe is configured to zero periods Tm it will remain high throughout the complete cycle Tmx. 

7 f can be from zero to 31 periods Tm in length. If this length would cause the strobe to remain active past the 
end of T5 it will go high at the end of T5. If the strobe value is zero periods Tm it will remain low throughout 
the complete cycle Tmx. 

8 g is one complete external memory cycle comprising the total of T1+T2+T3+T4+Twalt+T5+T6 where T1, T2, 
T3, T4, T5 can be from one to four periods Tm each in length, T6 can be from one to five periods Tm in length 
and Twalt may be zero or any number of periods Tm in length,. 



Tstate|Tl|T2|T3|T4|T5|T6|Tl| 
notMemSI | I 


Tstate|Tl|T2|T3|T4|w|w|T5|T6|Tl| 


notMemSI 1 1 




notMemS2 

No wait states 


notMemS2 1 1 


Wait states inserted 



Figure 7.5 IMS T425 effect of wait states on strobes 
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Table 7.4 Strobe SO to ProcClockOut skew 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCHSOH 
TPCLSOH 
TPCHSOL 
TPCLSOL 


notMemSO rising from ProcClockOut rising 
notMemSO rising from ProcClockOut falling 
notMemSO falling from ProcClockOut rising 
notMemSO falling from ProcClockOut falling 


-6 
-5 
-8 
-5 




4 
10 
3 

7 


ns 
ns 
ns 
ns 





ProcClockOut 



NotMemSO 



i 



K TPCHSOH 



y- 



J_^T 



TPCHSOL 



rTPCLSOH 



r TPCLSOL 



Figure 7.6 IMS T425 skew of notMemSO to ProcClockOut 



7.9 



notMemWrBO-3 



Because the transputer uses word addressing, four write strobes are provided; one to write each byte of the 
word. If a particular byte is not to be written, then the corresponding data outputs are tristated. notMemWrBO 
addresses the least significant byte. 

The transputer has both early and late write cycle modes. For a late write cycle the relevant write strobes 
notMemWrBO-3 are low during T4 and T5; for an early write they are also low during T3. Data should be 
latched into memory on the rising edge of the strobes in both cases, although it is valid until the end of T6. 
If the strobe duration is insufficient, it may be extended at configuration time by adding extra periods Tm to 
either or both of Tstates T4 and T5 for both early and late modes. For an early cycle they may also be added 
to T3. Further extension may be obtained by inserting wait states at the end of T4. If the data hold time is 
insufficient, extra periods Tm may be added to T6 to extend it. 

Table 7.5 Write 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TdVWrH 


Data setup before write 


d-7 




d+10 


ns 


1,5 


TWrHdX 


Data hold after write 


a-10 




a+5 


ns 


1,2 


TSOLWrL 


notMemSO before start of early write 


b-5 




b+5 


ns 


1,3 




notMemSO before start of late write 


c-5 




c+5 


ns 


1,4 


TSOHWrH 


End of write from end of notMemSO 


-5 




4 


ns 


1 


TWrLWrH 


Early write pulse width 


d-4 




d+7 


ns 


1,5 




Late write pulse width 


e-4 




e+7 


ns 


1,6 



Notes 



1 Timing is for all write strobes notMemWrBO-3. 

2 a is T6 where T6 can be from one to five periods Tm in length. 

3 b is T2 where T2 can be from one to four periods Tm in length. 

4 c is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

5 d is total of T3+T4+Twalt+T5 where T3, T4, T5 can be from one to four periods Tm each in length and Twait 
may be zero or any number of periods Tm in length. 

6 e Is total of T4+Twalt+T5 where T4, T5 can be from one to four periods Tm each in length and Twait may be 
zero or any number of periods Tm in length. 
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Figure 7.7 IMS T425 external write cycle 

In the write cycle timing diagram ProcClockOut is included as a guide only; It Is shown with each Tstate 
configured to one period Tm. The strobe is inactive during internal memory cycles. 
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Figure 7.8 IMS T425 dynamic RAM application 
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7.10 MemConfig 

MemConfig is an input pin used to read configuration data when setting external memory interface (EMI) 
characteristics. It is read by the processor on two occasions after Reset goes low; first to check if one of the 
preset internal configurations is required, then to determine a possible external configuration. 

7.10.1 Internal configuration 

The internal configuration scan comprises 64 periods TDCLDCL of Clockln during the internal scan period 
of 144 Clockln periods. MemnotWrDO, MemnotRfDI and MemAD2-32 are all high at the beginning of the 
scan. Starting with MemnotWrDO, each of these lines goes low successively at intervals of two Clockln 
periods and stays low until the end of the scan. If one of these lines is connected to MemConfig the preset 
internal configuration mode associated with that line will be used as the EMI configuration. The default 
configuration is that defined in the table for MemAD31 ; connecting MemConfig to VCC will also produce 
this default configuration. Note that only 17 of the possible configurations are valid, all others remain at the 
default configuration. 

Table 7.6 IMS T425 internal configuration coding 





Duration of each Tstate 




Strobe 




Write 


Refresh 


Cycle 








periods Tm 








coefficient 




cycle 


interval 


time 




















Clockln 


Proc 


Pin 


11 


12 


T3 14 


T5 


T6 


Si 


s2 S3 


s4 


type 


cycles 


cycles 


MemnotWrDO 


1 


1 


1 1 


1 


1 


30 


1 3 


5 


late 


72 


3 


MemnotRfDI 


1 


2 


1 1 


1 


2 


30 


1 2 


7 


late 


72 


4 


MemAD2 


1 


2 


1 1 


2 


3 


30 


1 2 


7 


late 


72 


5 


MemAD3 


2 


3 


1 1 


2 


3 


30 


1 3 


8 


late 


72 


6 


MemAD4 


1 


1 


1 1 


1 


1 


3 


1 2 


3 


early 


72 


3 


MemADS 


1 


1 


2 1 


2 


1 


5 


1 2 


3 


early 


72 


4 


MemADS 


2 


1 


2 1 


3 


1 


6 


1 2 


3 


early 


72 


5 


MemAD7 


2 


2 


2 1 


3 


2 


7 


1 3 


4 


early 


72 


6 


MemADS 


1 


1 


1 1 


1 


1 


30 


1 2 


3 


early 


t 


3 


MemADS 


1 


1 


2 1 


2 


1 


30 


2 5 


9 


early 


t 


4 


MemADIO 


2 


2 


2 2 


4 


2 


30 


2 3 


8 


late 


72 


7 


MemADII 


3 


3 


3 3 


3 


3 


30 


2 4 


13 


late 


72 


9 


MemAD12 


1 


1 


2 1 


2 


1 


4 


1 2 


3 


early 


72 


4 
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1 


2 1 


2 
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5 


1 2 


3 


early 


72 


5 


MemAD14 
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2 1 
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2 


6 


1 3 


4 
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72 


6 


MemADIS 
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1 


2 3 


3 


3 


8 


1 2 


3 


early 


72 


7 
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4 


4 4 


4 


4 


31 


30 30 


18 


late 


72 


12 



t Provided for static RAM only 
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Figure 7.9 IMS T425 internal configuration 
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Figure 7.10 IMS T425 internal configuration scan 

7.10.2 External configuration 

If MemConfig is held low until MemnotWrDO goes low the internal configuration is ignored and an external 
configuration will be loaded instead. An external configuration scan always follows an internal one, but if an 
internal configuration occurs any external configuration is ignored. 

The external configuration scan comprises 36 successive external read cycles, using the default EMI con- 
figuration preset by Mem AD31 . However, instead of data being read on the data bus as for a normal read 
cycle, only a single bit of data is read on MemConfig at each cycle. Addresses put out on the bus for each 
read cycle are shown in table 7.7, and are designed to address ROM at the top of the memory map. The 
table shows the data to be held in ROM; data required at the MemConfig pin is the inverse of this. 

MemConfig is typically connected via an inverter to MemnotWrDO. Data bit zero of the least significant byte 
of each ROM word then provides the configuration data stream. By switching MemConfig between various 
data bus lines up to 32 configurations can be stored in ROM, one per bit of the data bus. MemConfig can be 
permanently connected to a data line or to GND. Connecting MemConfig to GND gives all Tstates configured 
to four periods; notMemSI pulse of maximum duration; notMemS2-4 delayed by maximum; refresh interval 
72 periods of Clockln; refresh enabled; late write. 

The external memory configuration table 7.7 shows the contribution of each memory address to the 13 con- 
figuration fields. The lowest 12 words (#7FFFFF6G to #7FFFFF98, fields 1 to 6) define the number of extra 
periods Tm to be added to each Tstate. If field 2 is 3 then three extra periods will be added to T2 to extend 
it to the maximum of four periods. 

The next five addresses (field 7) define the duration of notMemSI and the following fifteen (fields 8 to 10) 
define the delays before strobes notMemS2-4 become active. The five bits allocated to each strobe allow 
durations of from to 31 periods Tm, as described in strobes page 290. 

Addresses #7FFFFFEC to #7FFFFFF4 (fields 1 1 and 12) define the refresh interval and whether refresh is to 
be used, whilst the final address (field 13) supplies a high bit to MemConfig if a late write cycle is required. 



The columns to the right of the coding table show the values of each configuration bit for the four sample 
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external configuration diagrams. Note the Inclusion of period E at the end of 16 in some diagrams. This is 
inserted to bring the start of the next Tstate T1 to coincide with a rising edge of ProcClockOut (page 288). 

Wait states W have been added to show the effect of them on strobe timing; they are not part of a configuration. 
In each case which includes wait states, two wait periods are defined. This shows that if a wait state would 
cause the start of 15 to coincide with a falling edge of ProcClocl<Out, another period Tm is generated by 
the EMI to force it to coincide with a rising edge of ProcClockOut. This coincidence is only necessary if wait 
states are added, otherwise coincidence with a falling edge Is permitted. Any configuration memory access 
is only permitted to be extended using wait, up to a total of 14 Clockln periods. 
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Figure 7.11 IMS T425 external configuration 
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Internal configuration 
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© Configuration field 1 ; T1 configured for 2 periods Tm 
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Figure 7.12 IMS T425 external configuration scan 
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Table 7.7 IMS T425 external configuration coding 



Scan 
cycle 


MemAD 
address 


Field 


Function 


Example diagram 


1 


2 


3 


4 


1 


7FFFFF6C 


1 


T1 least significant bit 














2 


7FFFFF70 


1 


T1 most significant bit 














3 


7FFFFF74 


2 


T2 least significant bit 


1 








1 


4 


7FFFFF78 


2 


T2 most significant bit 














5 


7FFFFF7C 


3 


T3 least significant bit 


1 


1 


1 


1 


6 


7FFFFF80 


3 


T3 most significant bit 














7 


7FFFFF84 


4 


T4 least significant bit 














8 


7FFFFF88 


4 


T4 most significant bit 














9 


7FFFFF8C 


5 


T5 least significant bit 














10 


7FFFFF90 


5 


T5 most significant bit 














11 


7FFFFF94 


6 


T6 least significant bit 


1 





1 


1 


12 


7FFFFF98 


6 


T6 most significant bit 














13 


7FFFFF9C 


7 


notMemSI least significant bit 








1 


1 


14 


7FFFFFA0 


7 
















15 


7FFFFFA4 


7 


* ^ 














16 


7FFFFFA8 


7 




1 











17 


7FFFFFAC 


7 


notMemSI most significant bit 














18 


7FFFFFB0 


8 


notMemS2 least significant bit 


1 








1 


19 


7FFFFFB4 


8 




1 


1 





1 


20 


7FFFFFB8 


8 


^ ^ 











1 


21 


7FI-hhFBC 


8 
















22 


7FFFFFC0 


8 


notMemS2 most significant bit 














23 


7FFFFFC4 


9 


notMemS3 least significant bit 


1 


1 


1 


1 


24 


7FFFFFC8 


9 







1 








25 


7FFFFFCC 


9 


^ ^ 





1 





1 


26 


7FFFFFD0 


9 










1 





27 


7FFFFFD4 


9 


notMemS3 most significant bit 














28 


7FFFFFD8 


10 


notMemS4 least significant bit 











1 


29 


7FFFFFDC 


10 







1 


1 


1 


30 


7FFFFFE0 


10 


^ ^ 


1 


1 








31 


7FFFFFE4 


10 
















32 


7FFFFFE8 


10 


notMemS4 most significant bit 














33 


7FFFFFEC 


11 


Refresh Interval least significant bit 


- 


- 


- 


- 


34 


7FFFFFF0 


11 


Refresh Interval most significant bit 


- 


- 


- 


- 


35 


7FFFFFF4 


12 


Refresh Enable 


- 


- 


- 


- 


36 


7FFFFFF8 


13 


Late Write 





1 


1 
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Table 7.8 IMS T425 memory refresh configuration coding 



Refresh 


Interval 


Field 11 


Complete 


interval 


in /xs 


encoding 


cycle (mS) 


18 


3.6 


00 


0.922 


36 


7.2 


01 


1.843 


54 


10.8 


10 


2.765 


72 


14.4 


11 


3.686 



Refresh intervals are in periods of Clockln and Clockln frequency is 5 MHz: 

Interval = 18 * 200 = 3600 ns 

Refresh interval is between successive incremental refresh addresses. 
Complete cycles are shown for 256 row DRAMS. 

Table 7.9 Memory configuration 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMCVRdH 
TRdHMCX 
TSOLRdH 


Memory configuration data setup 
Memory configuration data hold 
notMemSO to configuration data read 


25 



a-12 




a+12 


ns 
ns 
ns 


1 



Notes 



1 a is 1 6 periods Tm. 



Tstate I T1 i T2 i T3 i T4 . T5 

"J^ I I I ll I I I I I I I I I I I I II 
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MemAD2-31 
notMemSO 

notMemRd 
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X. 
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Data 
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Figure 7.13 IMS T425 external configuration read cycle timing 
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7.1 1 RefreshPencfing 

When high, this pin signals that a refresh cycle is pending. It remains high until the refresh cycle is started by 
the transputer. The minimum time for the RefreshPending pin to be high is for one cycle of ProcClocl^Out 
(two periods Tm), when the EMI was not about to perform a memory read or write. If the EMI was held in 
the tristate condition with MemGranted asserted, then RefresliPending will be asserted when the refresh 
controller in the EMI is ready to perform a refresh. IVIemReq may be re-asserted any time after the com- 
mencement of the refresh cycle. RefreshPending changes state near the rising edge of ProcClockOut and 
can therefore be sampled by the falling edge of ProcClockOut. 

If no DMA is active then refresh will be performed following the end of the current internal or external memory 
cycle. If DMA is active the transputer will wait for DMA to terminate before commencing the refresh cycle. 
Unlike MemnotRfDI, RefreshPending is never tristated and can thus be interrogated by the DMA device; 
the DMA cycle can then be suspended, at the discretion of the DMA device, to allow refresh to take place. 

The simple circuit of Figure 7.14 will suspend DMA requests from the external logic when RefreshPending 
is asserted, so that a memory refresh cycle can be performed. DMA is restored on completion of the refresh 
cycle. The transputer will not perform an external memory cycle other than a refresh cycle, using this method, 
until the requesting device removes its DMA request. 
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Figure 7.14 IMS T425 refresh with DMA 
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Figure 7.15 IMS T425 RefreshPending timing 
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7.12 



notMemRf 



The IMS T425 can be operated with memory refresh enabled or disabled. The selection Is made during 
memory configuration, when the refresh interval is also determined. Refresh cycles do not interrupt internal 
memory accesses, although the internal addresses cannot be reflected on the external bus during refresh. 

When refresh is disabled no refresh cycles occur. During the post-Reset period eight dummy refresh cycles 
will occur with the appropriate timing but with no bus or strobe activity. 

A refresh cycle uses the same basic external memory timing as a normal external memory cycle, except that 
it starts two periods Tm before the start of T1. If a refresh cycle is due during an external memory access, 
it will be delayed until the end of that external cycle. Two extra periods Tm (periods R in the diagram) will 
then be inserted between the end of T6 of the external memory cycle and the start of T1 of the refresh cycle 
itself. The refresh address and various external strobes become active approximately one period Tm before 
T1 . Bus signals are active until the end of T2, whilst notMemRf remains active until the end of T6. 

For a refresh cycle, MemnotRfDI goes low before notMemRf goes low and MemnotWrDO goes high with 
the same timing as MemnotRfDI. All the address lines share the same timing, but only MemAD2-11 give 
the refresh address. Mem AD1 2-30 stay high during the address period, whilst MemAD31 remains low. 
Refresh cycles generate strobes notMemSO-4 with timing as for a normal external cycle, but notMemRd and 
notMemWrBO-3 remain high. MemWait operates normally during refresh cycles. 

Table 7.10 Memory refresh 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TRfLRfH 

TRaVSOL 

TRfLSOL 


Refresh pulse width low 

Refresh address setup before notMemSO 

Refresh indicator setup before notMemSO 


a-2 
b-12 
b-4 


b 


a+9 
b+6 


ns 
ns 
ns 


1 
2 



Notes 



1 a is total Tmx+Tm. 

2 b is total T1+Tm where T1 can be from one to four periods Tm in length. 
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Tstate 

normal cycle ~ 
MemAD2-31 - 


T4 T5 


T6 T1 T2 T3 


T4 T5 
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Figure 7.16 IMS T425 refresh cycle timing 



7.13 



MemWait 



Taking MemWait high with the timing shown will extend the duration of 14. MemWait is sampled relative 
to the falling edge of ProcClockOut during a T3 period, and should not change state in this region. By 
convention, notMemS4 is used to synchronize wait state insertion. If this or another strobe is used, its delay 
should be such as to take the strobe low an even number of periods Tm after the start of T1, to coincide with 
a rising edge of ProcClockOut. 

MemWait may be kept high indefinitely, although if dynamic memory refresh is used It should not be kept 
high long enough to interfere with refresh timing. MemWait operates normally during all cycles, including 
refresh and configuration cycles. It does not affect internal memory access in any way. 

If the start of T5 would coincide with a failing edge of ProcClockOut an extra wait period Tm (EW) is 
generated by the EMI to force coincidence with a rising edge. Rising edge coincidence is only forced if wait 
states are added, otherwise coincidence with a falling edge is permitted. 



7 External memory interface 



307 



Table 7.1 1 Memory wait 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLWtH 
TPCLWtL 
TWtLWtH 


Wait setup 

Wait hold 

Delay before re-assertion of Wait 


-(0.5Tm+9) 

0.5Tm+10 

2Tm 






ns 
ns 


1,2 
1,2 



Notes 



1 ProcClockOut load should not exceed 50pf. 

2 If wait period exceeds refresh interval, refresh cycles will be lost. 
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Figure 7.17 IMS T425 memory wait timing 
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7.14 MemReq, MemG ranted 

Direct memory access (DMA) can be requested at any time by taking the asynchronous MemReq input high. 
The transputer samples MemReq during the final period Tm of T6 of both refresh and external memory 
cycles. To guarantee taking over the bus immediately following either, MemReq must be set up at least two 
periods Tm before the end of T6. In the absence of an external memory cycle, MemReq is sampled during 
every low period of ProcClockOut. The address bus is tristated two periods Tm after the ProcCioclcOut 
rising edge which follows the sample. MemGranted Is asserted one period Tm after that. 

Removal of MemReq is sampled during each low period of ProcClockOut and MemGranted is removed 
synchronously with the next falling edge of ProcClockOut. If accurate timing of DMA is required, MemReq 
should be set low coincident with a falling edge of ProcClockOut. Further external bus activity, either refresh, 
external cycles or reflection of internal cycles, will commence at the next rising edge of ProcClockOut. 

Strobes are left in their inactive states during DMA. DMA cannot interrupt a refresh or external memory cycle, 
and outstanding refresh cycles will occur before the bus is released to DMA. DMA does not interfere with 
internal memory cycles in any way, although a program running in internal memory would have to wait for 
the end of DMA before accessing external memory. DMA cannot access internal memory. If DMA extends 
longer than one refresh interval (Memory Refresh Configuration Coding, table 7.8), the DMA user becomes 
responsible for refresh. DMA may also inhibit an internally running program from accessing external memory. 

DMA allows a bootstrap program to be loaded into external RAM ready for execution after reset. If MemReq is 
held high throughout reset, MemGranted will be asserted before the bootstrap sequence begins. MemReq 
must be high at least one period TDCLDCL of Clockln before Reset. The circuit should be designed to 
ensure correct operation if Reset could interrupt a normal DMA cycle. 

Table 7.12 Memory request 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMRHMGH 
TMRLMGL 
TADZMGH 
TMGLADV 


Memory request response time 
Memory request end response time 
Bus tristate before memory granted 
Bus active after end of memory granted 


4Tm-2ns 
2Tm-2ns 
Tm-2ns 

-10ns 




7Tm+7ns 

5Tm+22ns 

Tm+22ns 

Tm+2ns 




1 



Notes 



1 These values assume no external memory cycle is in progress. If an external cycle Is active, maximum time 
could be (1 EMI cycle Tmx)+(1 refresh cycle TRfLRfH)+(6 periods Tm). 
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Figure 7.18 IMS T425 memory request timing 
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Figure 7.19 IMS T425 DMA sequence at reset 
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Figure 7.20 IMS T425 operation of MemReq, MemGranted with external, refresh memory cycles 
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Figure 7.21 IMS T425 operation of MemReq, MemGranted with external, internal memory cycles 
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8 Events 

EventReq and EventAck provide an asynchronous handshake interface between an external event and an 
internal process. When an external event takes EventReq high the external event channel (additional to the 
external link channels) is made ready to communicate with a process. When both the event channel and the 
process are ready the processor takes EventAck high and the process, if waiting, is scheduled. EventAck 
is removed after EventReq goes low. 

EventWaiting is asserted high by the transputer when a process executes an input on the event channel; 
typically with the occam event ? ANY instruction. It remains high whilst the transputer is waiting for or 
servicing EventReq and is returned low when EventAck goes high. The EventWaiting pin changes near the 
falling edge of ProcClockOut and can therefore be sampled by the rising edge of ProcClockOut. 

The EventWaiting pin can only be asserted by executing an in instruction on the event channel. The 
EventWaiting pin is not asserted high when an enable channel (enbc) instruction is executed on the Event 
channel (during an ALT construct in occam, for example). The EventWaiting pin can be asserted by executing 
the occam input on the event channel (such as Event ? ANY), provided that this does not occur as a 
guard in an alternative process. The EventWaiting pin can not be used to signify that an alternative process 
(ALT) is waiting on an input from the event channel. 

EventWaiting allows a process to control external logic; for example, to clock a number of inputs into a 
memory mapped data latch so that the event request type can be determined. This function is not available 
on the IMS T414 and IMS T800. 

Only one process may use the event channel at any given time. If no process requires an event to occur 
EventAck will never be taken high. Although EventReq triggers the channel on a transition from low to high, 
it must not be removed before EventAck is high. EventReq should be low during Reset; if not it will be 
ignored until it has gone low and returned high. EventAck is taken low when Reset occurs. 

If the process is a high priority one and no other high priority process is running, the latency is as described 
on page 269. Setting a high priority task to wait for an event input allows the user to interrupt a transputer 
program running at low priority. The time taken from asserting EventReq to the execution of the microcode 
interrupt handler in the CPU is four cycles. The following functions take place during the four cycles: 

Cycle 1 Sample EventReq at pad on the rising edge of ProcClockOut and synchronise. 

Cycle 2 Edge detect the synchronised EventReq and form the interrupt request. 

Cycle 3 Sample interrupt vector for microcode ROM in the CPU. 

Cycle 4 Execute the interrupt routine for Event rather than the next instruction. 
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Table 8.1 Event 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TVHKH 

TKHVL 

TVLKL 

TKLVH 

TKHEWL 

TKLEWH 


Event request response 

Event request hold 

Delay before removal of event acknowledge 

Delay before re-assertion of event request 

Event acknowledge to end of event waiting 

End of event acknowledge to event waiting 












6Tm+7ns 


ns 
ns 

ns 
ns 
ns 















EventReq / 


\ 


A 


/ 


TVHKH -^ 

EventAck y 

— >• 


^ TKHVL . 






-t- TVLKL 
^ TKLVH ^ 


* * 








/ 

h«- TKHEWL 




/~ 


^ TKLEWH ^ 




EventWaiting | 

Process waitin 


g for Event 






Event waiting for Process 



Figure 8.1 IMS T425 event timing 
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9 Links 

Four identical INMOS bi-directional serial links provide synchronized communication between processors 
and with the outside world. Each link comprises an input channel and output channel. A link between two 
transputers is implemented by connecting a link interface on one transputer to a link interface on the other 
transputer. Every byte of data sent on a link is acknowledged on the input of the same link, thus each signal 
line carries both data and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged data byte 
and that the receiving link is able to receive another byte. The sending link reschedules the sending process 
only after the acknowledge for the final byte of the message has been received. 

The IMS T425 links allow an acknowledge packet to be sent before the data packet has been fully received. 
This overlapped acknowledge technique is fully compatible with all other INMOS transputer links. 

The IMS T425 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they can 
be used at 5 or 20 Mbits/sec. Links are not synchronised with Clockln or ProcClockOut and are insensitive 
to their phases. Thus links from independently clocked systems may communicate, providing only that the 
clocks are nominally identical and within specification. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

Link speeds can be set by LinkSpeclal, LInkOSpeclal and Link123Special. The link speed can be 
set independently. Table 9.1 shows uni-directional and bi-directional data rates in Kbytes/sec for each link 
speed; LinknSpeclal is to be read as LInkOSpeclal when selecting link speed and as Link123Speclal for 
the others. Data rates are quoted for a transputer using internal memory, and will be affected by a factor 
depending on the number of external memory accesses and the length of the external memory cycle. 

Table 9.1 Speed Settings for Transputer Links 



Link 
Special 


Linkn 
Special 


Mbits/sec 


Kbytes/sec | 


Uni 


Bi 




1 
1 



1 


1 


10 
5 
10 
20 


910 

450 

910 

1740 


1250 

670 

1250 

2350 





H L, 
1 Ack 1 


H'H 01234567 L, 


1 Data 1 



Figure 9.1 IMS T425 link data and acknowledge packets 
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Table 9.2 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




10 Mbits/s 






10 


ns 


2 




5 Mbits/s 






30 


ns 


2 


CLIZ 


Linkin capacitance @ f=l MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for lOOn transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 
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Figure 9.2 IMS T425 link timing 
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Figure 9.3 IMS T425 buffered link timing 
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Figure 9.4 IMS T425 Links directly connected 
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Figure 9.5 IMS T425 Links connected by transmission line 
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Figure 9.6 IMS T425 Links connected by buffers 
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10 Electrical specifications 
10.1 DC electrical characteristics 



Table 10.1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


vcc 


DC supply voltage 





7.0 


V 


1.2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1,2.3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


^C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



1 All voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 10.2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 


4.75 


5.25 


V 


1 


VI, VO 


Input or output voltage 





VCC 


V 


1.2 


CL 


Load capacitance on any pin 




60 


PF 




TA 


Operating temperature range IMS T425-S 





70 


°C 


3 


TA 


Operating temperature range IMS T425-M 


-55 


125 


°C 


3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 10.3 DC characteristics 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VGG+0.5 


V 


1,2 


VIL 


Low level input voltage 


-0.5 


0.8 


V 


1.2 


li 


Input current @ GND<VI<VCG 




±10 


/iA 


1,2 


VOH 


Output high voltage @ IOH=2mA 


VGG-1 




V 


1.2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1,2 


108 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1,2.3.6 






65 


100 


mA 


1,2.4,6 


lOZ 


Tristate output current @ GND<VO<VCC 




±10 


/.A 


1,2 


PD 


Power dissipation 




1.0 


W 


2,5 


GIN 


Input capacitance @ f=1 MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=l MHz 




10 


PF 


6 



Notes 



1 All voltages are with respect to GND. 

2 Parameters for IMS T425-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading and program execution. 
Power dissipation for processor operating at 20 MHz. 

6 This parameter is sampled and not 100% tested. 



10.2 Equivalent circuits 
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Diodes are 1N916 



Figure 10.1 Load circuit for AG measurements 
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Figure 10.2 AC measurements timing waveforms 
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Figure 10.3 Tristate load circuit for AC measurements 
10.3 AC timing characteristics 

Table 10.4 Input, output edges 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


TDr 


Input rising edges 


2 


20 


ns 


1.2 


TDf 


Input falling edges 


2 


20 


ns 


1.2 


TQr 


Output rising edges 




25 


ns 


1 


TQf 


Output falling edges 




15 


ns 


1 


TSOLaHZ 


Address high to tristate 


a 


a+6 


ns 


3 


TSOLaLZ 


Address low to tristate 


a 


a+6 


ns 


3 



Notes 



1 Non-link pins; see section on links, 

2 All inputs except Clockln; see section on Clockln. 



3 a is T2 where T2 can be from one to four periods Tm in length. 
Address lines include MemnotWrDO, MemnotRfDI, MemAD2-31. 
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Figure 10.4 IMS T425 input and output edge timing 
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Figure 10.5 IMS T425 tristate timing relative to notMemSO 
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Figure 10.6 Typical rise/fall times 



1 Skew is measured between notl\AemSO with a standard load (2 Schottky TTL inputs and SOpF) and 
notlVlemSO with a load of 2 Schottky TTL inputs and varying capacitance. 
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10.4 Power rating 

Internal power dissipation P/jvt of transputer and peripheral chips depends on VCC, as shown In figure 10.7. 
Pint is substantially independent of temperature. 

Total power dissipation Pd of the chip is 

Pd -^Pint + Pio 
where Pio is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj^Ta+BJa^Pd 

where Ta is the external ambient temperature in °C and 6Ja is the junction-to-ambient thermal resistance in 
°C/W. BJa for each package Is given In the Packaging Specifications section. 
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Figure 10.7 IMS T425 internal power dissipation vs VCC 
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Figure 10.8 IMS T425 typical power dissipation with processor speed 
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Performance 



The performance of the transputer is measured in terms of the number of bytes required for the program, and 
the number of (internal) processor cycles required to execute the program. The figures here relate to OCCam 
programs. For the same function, other languages should achieve approximately the same performance as 
Occam. 



11.1 



Performance overview 



These figures are averages obtained from detailed simulation, and should be used only as an initial guide; 
they assume operands are of type INT. The abbreviations in table 11. 1 are used to represent the quantities 
indicated. In the replicator section of the table, figures in braces {} are not necessary if the number of 
replications is a compile time constant. To estimate performance, add together the time for the variable 
references and the time for the operation. 



Table 11.1 Key to performance table 



np 

ne 

r 

ts 

w 

P 

Eg 

Et 

Tb 

Tbp 

Tbc 

nsp 

nap 



number of component processes 

number of processes earlier in queue 

1 if INT parameter or array parameter, if not 

number of table entries (table size) 

width of constant in nibbles 

number of places to shift 

expression used in a guard 

timer expression used in a guard 

most significant bit set of multiplier ((-1) if the multiplier is 0) 

most significant bit set in a positive multiplier when counting from zero ((-1) if the multiplier is 0) 

most significant bit set in the two's complement of a negative multiplier 

Number of scalar parameters in a procedure 

Number of array parameters in a procedure 
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Table 1 1 .2 Performance 





Size (bytes) 


Time (cycles) 


Names 

variables 

in expression 
assigned to or input to 
in PROC or FUNCTION call, 
corresponding to an INT parameter 

channels 


1.1 +r 
1.1 +r 

1.1+r 
1.1 


2.1+2(r) 
1.1 +(r) 

1.1 +(r) 
2.1 


Array Variables (for single dimension arrays) 
constant subscript 
variable subscript 
expression subscript 




5.3 
5.3 




7.3 
7.3 


Declarations 

CHAN OF protocol 
[size] CHAN OF protocol 
PROC 


3.1 

9.4 

body+2 


3.1 

2.2 + 20.2*size 




Primitives 

assignment 

input 

output 

STOP 

SKIP 



4 
1 
2 





26.5 

26 

25 




Arithmetic operators 
+ 

* 

/ 

REM 

» « 


1 
2 
2 
2 
2 


1 

39 
40 
38 
3+p 


Modulo Arithmetic operators 
PLUS 
MINUS 

TIMES (fast multiply, positive operand) 
TIMES (fast multiply, negative operand) 


2 

1 
1 
1 


2 

1 
4+Tbp 
5+Tbc 


Boolean operators 
OR 
AND NOT 


4 
1 


8 
2 


Comparison operators 
= constant 
= variable 
<> constant 
<> variable 
> < 
>= <= 



2 

1 
3 
1 
2 


1 
3 
3 
5 
2 
4 


Bit operators 

A \/ X - 


2 


2 


Expressions 

constant in expression 
check if error 


w 

4 


w 

6 
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Table 1 1 .3 Performance 





Size (bytes) 


Time (cycles) 


Timers 






timer input 


2 


3 


timer after 






if past time 


2 


4 


witli empty timer queue 


2 


31 


non-empty timer queue 


2 


38+ne*9 


ALT (timer) 






witli empty timer queue 


6 


52 


non-empty timer queue 


6 


59+ne*9 


timer alt guard 


8+2Eg+2Et 


34+2Eg+2Et 


Constructs 






SEQ 








IF 


1.3 


1.4 


if guard 


3 


4.3 


ALT (non timer) 


6 


26 


alt channel guard 


10.2+2Eg 


20+2Eg 


skip alt guard 


8+2Eg 


10+2Eg 


PAR 


11.5+(np-1)*7.5 


19.5+(np-1)*30.5 


WHILE 


4 


12 


Procedure or function call 








3.5+(nsp-2)*1.1 


16.5+(nsp-2)*1.1 




+nap*2.3 


+nap*2.3 


Replicators 






replicated SEQ 


7.3{+5.1} 


(-3.8)+15.1*count{+7.1} 


replicated IF 


12.3{+5.1} 


(-2.6)+19.4*count{+7.1} 


replicated ALT 


24.8{+10.2} 


25.4+33.4*COunt{+14.2} 


replicated timer alt 


24.8{+10.2} 


62.4+33.4*count{+14.2} 


replicated PAR 


39.1 {+5.1} 


(-6.4)+70.9*count{+7.1} 



11.2 Fast multiply, TIMES 

The IMS T425 has a fast integer multiplication instruction product For a positive multiplier its execution time 
is 4+Tbp cycles, and for a negative multiplier 5+Tbc cycles (table 11.1). The time taken for a multiplication 
by zero is 3 cycles. 

Implementations of high level languages on the transputer may take advantage of this instruction. For example, 
the Occam modulo arithmetic operator TIMES is implemented by the instruction and the right-hand operand is 
treated as the multiplier. The fast multiplication instruction is also used in high level language implementations 
for the multiplication implicit in multi-dimensional array access. 



11.3 



Arithmetic 



A set of functions are provided within the development system to support the efficient implementation of 
multiple length integer arithmetic and floating point arithmetic. In table 11.4 n gives the number of places 
shifted and all arguments and results are assumed to be local. Full details of these functions are provided 
in the OCCam reference manual, supplied as part of the development system and available as a separate 
publication. 

When calculating the execution time of the predefined maths functions, no time needs to be added for calling 
overhead. These functions are compiled directly into special purpose instructions which are designed to 
support the efficient implementation of multiple length integer arithmetic and floating point arithmetic. 



11 Performance 



323 



Table 1 1 .4 Arithmetic performance 



Function 


Cycles 


+ cycles for 
parameter access 


t 


LONGADD 




2 


7 




LONGSUM 




3 


8 




LONGSUB 




2 


7 




LONGDIFF 




3 


8 




LONGPROD 




34 


8 




LONGDIV 




36 


8 




SHIFTRIGHT 


(n<32) 


4+n 


8 




SHIFTLRFT 
NORMALISE 


(n>=32) 
(n<32) 
(n>=32) 
(n<32) 


n-27 
4+n 
n-27 
n+6 


8 

7 




ASHIFTRIGHT 


(n>=32) 
(n=64) 


n-25 

4 

SHIFTRIGHT+2 


5 




ASHIFTLEFT 




SHIFTLEFT+4 


5 




ROTATERIGHT 




SHIFTRIGHT 


7 




ROTATELEFT 




SHIFTLEFT 


7 




FRACMaL 




LONGPROD+4 


5 





t Assuming local variables. 



11.4 Floating point operations 



Floating point operations for the IMS T425 are provided by a run-time package. This requires approximately 
400 bytes of memory for the single length arithmetic operations, and 2500 bytes for the double length arithmetic 
operations. Table 1 1 .5 summarizes the estimated performance of the package. 



Table 1 1 .5 IMS T425 floating point operations performance 





Processor cycles 


IMS 1425 


Typical 


Worst 


REAL32 + - 
* 

/ 

<> = >=<=<> 

REAL64 + - 

* 

/ 


230 
200 
245 
60 

565 

760 

1115 

60 


300 
240 
280 
60 

700 

940 

1420 

60 
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11.4.1 Special purpose functions and procedures 

The functions and procedures given in tables 1 1 .7 and 1 1 .8 are provided by tine development system to give 
access to the special instructions available on the IMS T425. Table 11.6 shows the key to the table. 



Table 1 1 .6 Key to special performance table 



Tb 
n 
r 
nr 



most significant bit set in the word counting from zero 
number of words per row (consecutive memory locations) 
number of rows in the two dimensional move 
number of bits to reverse 



Table 1 1 .7 Special purpose functions performance 



Function 


Cycles 


+ cycles for 
parameter access f 


BITCOUNT 


2+Tb 


2 


CRCBYTE 


11 


8 


CRCWORD 


35 


8 


BITREVNBIT 


5+nr 


4 


BITREVWORD 


36 


2 



t Assuming local variables. 



Table 1 1 .8 Special purpose procedures performance 



Procedure 


Cycles 


+ cycles for 
parameter access f 


M0VE2D 
DRAW2D 
CLIP2D 


8+(2n+23)*r 
8+(2n+23)*r 
8+(2n+23)*r 


8 
8 
8 



t Assuming local variables. 



11.5 Effect of external memory 

Extra processor cycles may be needed when program and/or data are held in external memory, depending 
both on the operation being performed, and on the speed of the external memory. After a processor cycle 
which Initiates a write to memory, the processor continues execution at full speed until at least the next 
memory access. 

Whilst a reasonable estimate may be made of the effect of external memory, the actual performance will 
depend upon the exact nature of the given sequence of operations. 

External memory is characterized by the number of extra processor cycles per external memory cycle, denoted 
as e. For the IMS T425, with the fastest external memory the value of e is 2; a typical value for a large external 
memory is 5. 

If program is stored in external memory, and e has the value 2 or 3, then no extra cycles need be estimated for 
linear code sequences. For larger values of e, the number of extra cycles required for linear code sequences 
may be estimated at (e-3)/4. A transfer of control may be estimated as requiring e+3 cycles. 
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These estimates may be refined for various constructs. In table 11 .9 n denotes the number of components in 
a construct. In the case of IF, the n'th conditional is the first to evaluate to TRUE, and the costs include the 
costs of the conditionals tested. The number of bytes in an array assignment or communication is denoted 
by b. 

Table 1 1 .9 External memory performance 





IMS T425 1 


Program off chip 


Data off chip 


Boolean expressions 


e-2 





IF 


3en-8 


en 


Replicated IF 


(6e-4)n+7 


(5e-2)n+8. 


Replicated SEQ 


(3e-3)n+2 


(4e-2)n 


PAR 


(3e-1)n+8 


3en+4 


Replicated PAR 


(10e-8)n+8 


16en-12 


ALT 


(2e-4)n+6e 


(2e-2)n+10e-8 


Array assignment and 





max (2e, e(b/2)) 


communication in 






one transputer 







The following simulation results illustrate the effect of storing program and/or data in external memory. The 
results are normalized to 1 for both program and data on chip. The first program (Sieve of Erastosthenes) 
is an extreme case as it is dominated by small, data access intensive loops; it contains no concurrency, 
communication, or even multiplication or division. The second program Is the pipeline algorithm for Newton 
Raphson square root computation. 



Table 11.10 IMS T425 external memory performance 





Program 


8=2 


8=3 


8=4 


8=5 


On chip 


Program off chip 


1 
2 


1.3 
1.1 


1.5 
1.2 


1.7 

1.2 


1.9 
1.3 




Data off chip 


1 
2 


1.5 

1.2 


1.8 
1.4 


2.1 
1.6 


2.3 

1.7 




Program and data off chip 


1 
2 


1.8 
1.3 


2.2 

1.6 


2.7 
1.8 


3.2 
2.0 





11.6 Interrupt latency 

If the process is a high priority one and no other high priority process is running, the latency is as described 
in table 11.11. The timings given are in full processor cycles TPCLPCL; the number of Tm states is also 
given where relevant. Maximum latency assumes all memory accesses are Internal ones. 



Table 11.11 Interrupt latency 





Typical 


Maximum 


TPCLPCL 


Tm 


TPCLPCL 


Tm 


IMS T425 


19 


38 


53 


116 
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Figure 12.1 IMS T425 84 pin grid array package pinout 
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Figure 12.2 84 pin grid array package dimensions 



Table 12.1 84 pin grid array package dimensions 



DIM 


[\/lillimetres 


Inches 


Notes 


NOIVI 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B 


17.019 


±0.127 


0.670 


±0.005 




C 


2.456 


±0.278 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.025 


0.018 


±0.002 


Pin diameter 


G 


1.143 


±0.127 


0.045 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Package weight is approximately 7.2 grams 

Table 12.2 84 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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12.2 84 pin PLCC J-bend package 





W^m'SSaW O r- W n -K Srl 
OLU0COliIjQ.Ij>ZjIjIjIiIiIiI]IJLUOLUS 

T-OOSOOr^CDLO^COCSJf-'^COOJ-r-OOOOr^CDLO 
■r-T— COOOCOOOOOf^N'f^f^N' 

nnnnnnnnnnnnnnnnnnnnn 




CapMinus 12 c 

VCC 13 c: 

ProcSpeedSelectO 14 c 

GND ISC 

Errorin 1 6 c 

ProcSpeedSelect2 17 c 

Error 18 c 

BootFromROM 19 c: 
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Figure 12.3 IMS T425 84 pin PLCC J-bend package pinout 
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Figure 12.4 84 pin PLCC J-bend package dimensions 



Table 12.3 84 pin PLCC J-bend package dimensions 



DIIVI 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


30.226 


±0.127 


1.190 


±0.005 




B 


29.312 


±0.127 


1.154 


±0.005 




C 


3.810 


±0.127 


0.150 


±0.005 




D 


0.508 


±0.127 


0.020 


±0.005 




F 


1.270 


±0.127 


0.050 


±0.005 




G 


0.457 


±0.127 


0.018 


±0.005 




J 


0.000 


±0.051 


0.000 


±0.002 




K 


0.457 


±0.127 


0.018 


±0.005 




L 


0.762 


±0.127 


0.030 


±0.005 





Package weight is approximately 7.0 grams 

Table 12.4 84 pin PLCC J-bend package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 




35 




°c/w 
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12.3 84 lead quad cerpacl< paclcage 

The leads are unformed to allow the user to form them to specific requirements. 
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Figure 12.5 IMS T425 84 lead quad cerpack package pinout 



12 Package specifications 



331 































• 


< ^ > 








.- ,„... _ 















































^ >l 








-< 


N 






y ' 




u 


. u 






u 




3 


UU 


J 


UUL 


J 


A 



Section through 
package 



U 



M 



Figure 12.6 84 lead quad cerpack package dimensions 



DliVi 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


38.100 


±0.508 


1.500 


±0.020 




B 


26.924 


±0.305 


1.060 


±0.012 




C 


20.574 


±0.203 


0.810 


±0.008 




D 


19.558 


±0.254 


0.770 


±0.010 




E 


0.508 




0.020 






F 


1.270 


±0.051 


0.050 


±0.002 




G 


2.489 


±0.305 


0.098 


±0.012 




H 


0.635 


±0.076 


0.025 


±0.003 




J 


1.143 


±0.102 


0.045 


±0.004 




K 


3.099 




0.122 




Max. 


L 


27.940 




1.100 




Max. 


M 


0.178 


±0.025 


0.007 


±0.001 





Table 12.5 84 lead quad cerpack package dimensions 
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13 Ordering 

Tliis section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 

Table 13.1 IMS T425 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IMS T425-G17S 
IMS T425-G20S 
IMS T425-G25S 
IMS T425-G30S 


17.5 MHz 
20.0 MHz 
25.0 MHz 
30.0 MHz 


57 ns 
50 ns 
40 ns 
33 ns 


3.5 
4.0 
5.0 
6.0 


Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 


IMS T425-J17S 
IMS T425-J20S 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Plastic PLCC J-Bend 
Plastic PLCC J-Bend 


IMS T425-G17M 
IMS T425-G20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Ceramic Pin Grid MIL Spec 
Ceramic Pin Grid MIL Spec 


IMS T425-Q17M 
IMS T425-Q20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Quad Cerpack MIL Spec 
Quad Cerpack MIL Spec 



The timing parameters in this datasheet are based on 17 MHz and 20 MHz parts. Data for higher speeds is 
based on tests on a limited number of samples and may change when full characterisation is completed. 
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IMST414 

transputer 



DOiimos 



The IMS T425 
is recommended 
for new designs 



Engineering Data 



FEATURES 

32 bit architecture 

50 ns internal cycle time 

20 MIPS (peak) instruction rate 

IMS T414-20 is pin compatible with the IMS T805-20, 

IMS T800-20 and IMS T425-20 
2 Kbytes on-chip static RAM 
80 Mbytes/sec sustained data rate to internal memory 
4 Gbytes directly addressable external memory 
26 Mbytes/sec sustained data rate to external memory 
950 ns response to interrupts 
Four INMOS serial links 5/10/20 Mbits/sec 
Bi-directional data rate of 1.6 Mbytes/sec per link 
Internal timers of 1 fis and 64 fis 
Boot from ROM or communication links 
Single 5 MHz clock input 
Single +5V ±5% power supply 

APPLICATIONS 

High speed multi processor systems 

Real time processing 

Microprocessor applications 

Workstations and workstation clusters 

Image processing 

Graphics processing 

Accelerator processors 

Distributed databases 

Supercomputers 

System simulation 

Digital signal processing 

Telecommunications 

Robotics 

Fault tolerant systems 

Medical instrumentation 

Pattern recognition 

Artificial intelligence 
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1 Introduction 

The IMS T414 transputer is a 32 bit CMOS microcomputer with 2 Kbytes on-chip RAM for high speed 
processing, a configurable memory interface and four standard INMOS communication links. The instruction 
set achieves efficient implementation of high level languages and provides direct support for the OCCam 
model of concurrency when using either a single transputer or a network. Procedure calls, process switching 
and typical interrupt latency are sub-microsecond. The IMS T414 provides high performance arithmetic 
and microcode support for floating point operations. A device running at 20 MHz achieves an Instruction 
throughput of 10 MIPS. 

For convenience of description, the IMS T414 operation is split into the basic blocks shown in figure 1 .1 . 



vcc 

GND 

CapPlus 

CapMinus 

Reset 

Analyse 

Error 

BootFromROM 

Clockln 





System 
services 





















DisableintRam 



ProcClockOut -^ 

notMemSO-4 -*- 

notMemWrBO-3 -<- 

notMemRd -*- 

notMemRf -*- 



MemWait - 

MemConfig ■ 

MemReq ■ 

Mem Granted - 



Timers 



2k bytes 

of 

On-chip 

RAM 



32 

N~^ — ^ 



External 
Memory 
Interface 



32 



32 



32 bit 
Processor 



32 



32 



32 



32 



Link 
Services 



Link 



V Interface 



Link 



V Interface 



Link 



V Interface 



Link 
Interface 



Event 



32 



LinkSpecial 

LinkOSpecial 

Link123Special 

LinklnO 



-*- LinkOutO 



Linkim 



-^ LinkOutI 



Link[n2 
LlnkOut2 

LinklnS 
LinkOutS 

EventReq 
EventAck 



MemnotWrDO 

MemnotRfDI 

MemAD2-31 



Figure 1 .1 IMS T414 block diagram 
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The IMS T414 can directly access a linear address space of 4 Gbytes. The 32 bit wide memory interface 
uses multiplexed data and address lines and provides a data rate of up to 4 bytes every 150 nanoseconds 
(26.6 Mbytes/sec) for a 20 MHz device. A configurable memory controller provides all timing, control and 
DRAM refresh signals for a wide variety of mixed memory systems. 

System Services include processor reset and bootstrap control, together with facilities for error analysis. 

The INMOS communication links allow networks of transputer family products to be constructed by direct 
point to point connections with no external logic. The IMS T414 links support the standard operating speed 
of 10 Mbits/sec, but also operate at 5 or 20 Mbits/sec. 

The IMS T414 is designed to implement the OCCam language, detailed in the OCCam Reference Manual, but 
also efficiently supports other languages such as C, Pascal and Fortran. Access to the transputer at machine 
level is seldom required, but if necessary refer to the Transputer Instruction Set - A Compiler Writers' Guide. 

This data sheet supplies hardware implementation and characterisation details for the IMS T414. It is intended 
to be read in conjunction with the Transputer Architecture chapter, which details the architecture of the 
transputer and gives an overview of occam. 
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Pin designations 



Table 2.1 IMS T414 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPlus, CapMlnus 




External capacitor for internal clock power supply 


Clocl<ln 


in 


Input clock 


Reset 


in 


System reset 


Error 


out 


Error indicator 


Analyse 


in 


Error analysis 


BootFromRom 


in 


Bootstrap from external ROM or from link 


DisablelntRAM 


in 


Disable internal RAM 


HoldToGND 




Must be connected to GND 


DoNotWire 




Must not be wired 



Table 2.2 IMS T414 external memory interface 



Pin 


In/Out 


Function 


ProcClockOut 


out 


Processor clock 


MemnotWrDO 
IVIemnotRfDI 
IVlemAD2-31 
notMemRd 


in/out 

in/out 

in/out 

out 


Multiplexed data bit and write cycle warning 
Multiplexed data bit 1 and refresh warning 
Multiplexed data and address bus 
Read strobe 


notMemWrBO-3 
notMemSO-4 
notMemRf 
MemWait 


out 
out 
out 
in 


Four byte-addressing write strobes 
Five general purpose strobes 
Dynamic memory refresh indicator 
Memory cycle extender 


MemReq 
MemG ranted 
MemConfig 


in 

out 
in 


Direct memory access request 
Direct memory access granted 
Memory configuration data input 



Table 2.3 IMS T41 4 event 



Pin 


In/Out 


Function 


Event Req 
EventAck 


in 
out 


Event request 

Event request acknowledge 



Table 2.4 IMS T41 4 link 



Pin 


In/Out 


Function 


LinklnO-3 


in 


Four serial data input channels 


LinkOutO-3 


out 


Four serial data output channels 


LinkSpecial 


in 


Select non-standard speed as 5 or 20 Mbits/sec 


LinkOSpecial 


in 


Select special speed for Link 


Link123Special 


in 


Select special speed for Links 1,2,3 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 393. 
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3 Processor 

The 32 bit processor contains instruction processing logic, instruction and work pointers, and an operand 
register. It directly accesses the high speed 2 Kbyte on-chip memory, which can store data or program. 
Where larger amounts of memory or programs in ROM are required, the processor has access to 4 Gbytes 
of memory via the External Memory Interface (EMI). 



3.1 



Registers 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; six registers are used in the execution of a sequential process. The small number of 
registers, together with the simplicity of the instruction set, enables the processor to have relatively simple 
(and fast) data-paths and control logic. The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The A, B and C registers which form an evaluation stack. 

A, B and C are sources and destinations for most arithmetic and logical operations. Loading a value into the 
stack pushes B into C, and A into S, before loading A. Storing a value from A, pops B into A and C into B. 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the aGfd instruction adds the top two values in the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 

Any location in memory can be accessed relative to the workpointer register, enabling the workspace to be 
of any size. 

Further register details are given in Transputer Instruction Set - A Compiler Writers' Guide. 





Registers 




Locals 




Program 






A 








B 






C 






Workspace 








Next Inst 












Operand 



















Figure 3.1 Registers 
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3.2 



Instructions 



The instruction set has been designed for simple and efficient compilation of high-level languages. All in- 
structions have the same format, designed to give a compact representation of the operations occurring most 
frequently in programs. 

Each instruction consists of a single byte divided into two 4-bit parts. The four most significant bits of the byte 
are a function code and the four least significant bits are a data value. 





Function Data 


7 4 3 

1 





Operand Register 





Figure 3.2 Instruction format 



3.2.1 Direct functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Ten of these, 
shown in table 3.1, are used to encode the most important functions. 

Table 3.1 Direct functions 



load constant 


add constant 




load local 


store local 


load local pointer 


load non-local 


store non-local 




jump 


conditional jump 


call 



The most common operations in a program are the loading of small literal values and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local and store local instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of high level programming languages such as Occam, C, Fortran, Pascal or ADA. 



3.2.2 



Prefix functions 



Two more function codes allow the operand of any instruction to be extended in length; prefix and negative 
prefix. 

All instructions are executed by loading the four data bits into the least significant four bits of the operand 
register, which is then used as the instruction's operand. All instructions except the prefix instructions end by 
clearing the operand register, ready for the next instruction. 

The prefix instruction loads its four data bits into the operand register and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 



Processor 
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The use of prefix instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form independent of the processor 
wordlength. 

3.2.3 Indirect functions 

The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate instruction just like any 
other. The instruction representation therefore provides for an indefinite number of operations. 

Encoding of the indirect functions Is chosen so that the most frequently occurring operations are represented 
without the use of a prefix instruction. These include arithmetic, logical and comparison operations such as 
add, exclusive or and greater than. Less frequently occurring operations have encodings which require a 
single prefix operation. 

3.2.4 Expression evaluation 

Evaluation of expressions sometimes requires use of temporary variables in the workspace, but the number 
of these can be minimised by careful choice of the evaluation order. 

Table 3.2 Expression evaluation 



Program 


IVInemonic 


X :=0 


Idc 

StI X 


X := #24 


pfix 2 
Idc 4 

StI X 


X := y + z 


Idl y 
Idl z 
add 

StI X 



3.2.5 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte; that is, without 
the use of prefix instructions. Many of these instructions, such as load constant and add require just one 
processor cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive four instructions for every fetch. 

Short instructions also improve the effectiveness of instruction pre-fetch, which in turn improves processor 
performance. There is an extra word of pre-fetch buffer, so the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer Is short, there is little time penalty when a jump instruction causes 
the buffer contents to be discarded. 
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3.3 



Processes and concurrency 



A process starts, performs a number of actions, and then either stops without completing or terminates 
complete. Typically, a process is a sequence of instructions. A transputer can run several processes in 
parallel (concurrently). Processes may be assigned either high or low priority, and there may be any number 
of each (page 341). 

The processor has a microcoded scheduler which enables any number of concurrent processes to be exe- 
cuted together, sharing the processor time. This removes the need for a software kernel. 



At any time, a concurrent process may be 

Active 

Inactive - 



Being executed. 

On a list waiting to be executed. 

Ready to input. 

- Ready to output. 

- Waiting until a specified time. 

The scheduler operates in such a way that inactive processes do not consume any processor time. It allocates 
a portion of the processor's time to each process in turn. Active processes waiting to be executed are held 
in two linked lists of process workspaces, one of high priority processes and one of low priority processes 
(page 341). Each list is implemented using two registers, one of which points to the first process in the list, 
the other to the last, in the Linked Process List figure 3.3, process S is executing and P, Q and R are active, 
awaiting execution. Only the low priority process queue registers are shown; the high priority process ones 
perform in a similar manner. 
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Figure 3.3 Linked process list 



Table 3.3 Priority queue control registers 



Function 


High Priority 


Low Priority 


Pointer to front of active process list 
Pointer to back of active process list 


FptrO 
BptrO 


Fptn 
Bptn 



Each process runs until it has completed its action, but is descheduled whilst waiting for communication from 
another process or transputer, or for a time delay to complete. In order for several processes to operate in 
parallel, a low priority process is only permitted to run for a maximum of two time slices before it is forcibly 
descheduled at the next descheduling point (page 344). The time slice* period is 5120 cycles of the external 
5 MHz clock, giving ticks approximately 1 ms apart. 
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A process can only be descheduled on certain instructions, known as desclieduling points (page 344). As a 
result, an expression evaluation can be guaranteed to execute without the process being timesliced partway 
through. 

Whenever a process is unable to proceed, its instruction pointer is saved in the process workspace and 
the next process taken from the list. Process scheduling pointers are updated by instructions which cause 
scheduling operations, and should not be altered directly. Actual process switch times are less than 1 /iS, as 
little state needs to be saved and it is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model, including start process 
and end process. When a main process executes a parallel construct, start process instructions are used 
to create the necessary additional concurrent processes. A start process instruction creates a new process 
by adding a new workspace to the end of the scheduling list, enabling the new concurrent process to be 
executed together with the ones already being executed. When a process is made active it is always added 
to the end of the list, and thus cannot pre-empt processes already on the same list. 

The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the parallel construct components which have still to terminate. The 
counter is initialised to the number of components before the processes are started. Each component ends 
with an end process instruction which decrements and tests the counter. For all but the last component, the 
counter is non zero and the component is descheduled. For the last component, the counter is zero and the 
main process continues. 

3.4 Priority 

The IMS T414 supports two levels of priority. Priority 1 (low priority) processes are executed whenever there 
are no active priority (high priority) processes. 

High priority processes are expected to execute for a short time. If one or more high priority processes are 
able to proceed, then one is selected and runs until it has to wait for a communication, a timer input, or until 
it completes processing. 

If no process at high priority is able to proceed, but one or more processes at low priority are able to proceed, 
then one is selected. 

Low priority processes are periodically timesliced to provide an even distribution of processor time between 
computationally intensive tasks. 

If there are n low priority processes, then the maximum latency from the time at which a low priority process 
becomes active to the time when it starts processing is 2n-2 timeslice periods. It is then able to execute for 
between one and two timeslice periods, less any time taken by high priority processes. This assumes that 
no process monopolises the transputer's time; i.e. it has a distribution of descheduling points (page 344). 

Each timeslice period lasts for 5120 cycles of the external 5 MHz input clock (approximately 1 ms at the 
standard frequency of 5 MHz). 

If a high priority process is waiting for an external channel to become ready, and if no other high priority 
process is active, then the interrupt latency (from when the channel becomes ready to when the process 
starts executing) is typically 19 processor cycles, a maximum of 58 cycles (assuming use of on-chip RAM). 

3.5 Communications 

Communication between processes is achieved by means of channels. Process communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 

A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
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links. The processor provides a number of operations to support message passing, the most important being 
input message and output message. 

The input message and output message instructions use the address of the channel to determine whether 
the channel is internal or external. Thus the same instruction sequence can be used for both, allowing a 
process to be written and compiled without knowledge of where its channels are connected. 

The process which first becomes ready must wait until the second one is also ready. A process performs an 
input or output by loading the evaluation stack with a pointer to a message, the address of a channel, and 
a count of the number of bytes to be transferred, and then executing an input message or output message 
instruction. Data is transferred if the other process is ready. If the channel is not ready or is an external one 
the process will deschedule. 



3.6 



Timers 



The transputer has two 32 bit timer clocks which 'tick' periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a specific time. 

One timer is accessible only to high priority processes and is incremented every microsecond, cycling com- 
pletely in approximately 4295 seconds. The other is accessible only to low priority processes and is incre- 
mented every 64 microseconds, giving exactly 15625 ticks in one second. It has a full period of approximately 
76 hours. 



Table 3.4 Timer registers 



CiockO 
Clockl 
TNextRegO 
TNextRegl 



Current value of high priority (level 0) process clock 

Current value of low priority (level 1) process clock 

Indicates time of earliest event on high priority (level 0) timer queue 

Indicates time of earliest event on low priority (level 1) timer queue 



The current value of the processor clock can be read by executing a load timer instruction. A process can 
arrange to perform a timer input, in which case it will become ready to execute after a specified time has 
been reached. The //mer /npL/rmstruction requires a time to be specified. If this time Is in the 'past' then the 
instruction has no effect. If the time is in the 'future' then the process is descheduled. When the specified 
time is reached the process is scheduled again. 

Figure 3.4 shows two processes waiting on the timer queue, one waiting for time 21 , the other for time 31 . 
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Figure 3.4 Timer registers 
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4 Instruction set summary 

The Function Codes table 4.7. gives the basic function code set (page 338). Where the operand is less 
than 16, a single byte encodes the complete instruction. If the operand is greater than 15, one prefix 
instruction {pfix) is required for each additional four bits of the operand. If the operand is negative the first 
prefix instruction will be nfix. 

Table 4.1 pref/x coding 









Function 


Memory 


Mnemonic 


code 


code 


Idc 




#3 


#4 


#43 


Idc 




#35 






is coded 


as 








pfix 




#3 


#2 


#23 


Idc 




#5 


#4 


#45 


Idc 




#987 






is coded 


as 








pfix 




#9 


#2 


#29 


pfix 




#8 


#2 


#28 


Idc 




#7 


#4 


#47 


Idc 




-31 {Idc 


#FFFFFFE1) 




is coded 


as 








nfix 




#1 


#6 


#61 


Idc 




#1 


#4 


#41 



Tables 4.8 to 4.18 give details of the operation codes. Where an operation code is less than 16 (e.g. add: 
operation code 05), the operation can be stored as a single byte comprising the operate function code F and 
the operand (5 in the example). Where an operation code is greater than 15 (e.g. ladd: operation code 16), 
the pre//x function code 2 is used to extend the instruction. 



Table 4.2 operate coding 



Mnemonic 



Function 
code 



Memory 
code 



add 
is coded as 

opr 

ladd 
is coded as 

pfix 
opr 



(op. code #5) 

add 

(op. code #1 6) 

#1 
#6 



#F 



#2 
#F 



#F5 

#F5 

#21 F6 

#21 
#F6 



The Processor Cycles column refers to the number of periods TPCLPCL taken by an instruction executing 
In internal memory. The number of cycles is given for the basic operation only; where the memory code 
for an instruction is two bytes, the time for the prefix function (one cycle) should be added. For a 20 MHz 
transputer one cycle is 50 ns. Some instruction times vary. Where a letter is included in the cycles column it 
is interpreted from table 4.3. 
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Table 4.3 Instruction set interpretation 



Ident 



Interpretation 



b 
n 
w 



Bit number of the highest bit set in register A. Bit is the least significant bit. 

Number of places shifted. 

Number of words in the message. Part words are counted as full words. If the message 
is not word aligned the number of words is increased to include the part words at either 
end of the message. 



The DE column of the tables indicates the descheduling/error features of an instruction as described in 
table 4.4. 

Table 4.4 Instruction features 



Ident 


Feature 


See page: 


D 

E 


The instruction is a descheduling point 
The instruction will affect the Error flag 


344 
344, 354 



4.1 



Descheduling points 



The instructions in table 4.5 are the only ones at which a process may be descheduled (page 340). They are 
also the ones at which the processor will halt if the Analyse pin is asserted (page 353). 



Table 4.5 Descheduling point instructions 



input message 
timer alt wait 
jump 



output message 
timer input 
loop end 



output byte 
stop on error 
end process 



output word 
alt wait 
stop process 



4.2 Error instructions 

The instructions in table 4.6 are the only ones which can affect the Error flag (page 354) directly. 



Table 4.6 Error setting instructions 



add 


add constant 


subtract 




multiply 


fractional multiply 


divide 


remainder 


long add 


long subtract 


long divide 




set error 


testerr 




cflerr 


check word 


check subscript from 


check single 


check count from 1 



4 Instruction set summary 



345 



Table 4.7 IMS T414 function codes 



Function 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 





OX 


J 


3 


jump 


D 


1 


1X 


Idip 


1 


load local pointer 




2 


2X 


pfix 


1 


prefix 




3 


3X 


Idnl 


2 


load non-local 




4 


4X 


Idc 


1 


load constant 




5 


5X 


Idnip 


1 


load non-local pointer 




6 


6X 


nfix 


1 


negative prefix 




7 


7X 


Idl 


2 


load local 




8 


8X 


adc 


1 


add constant 


E 


9 


9X 


call 


7 


call 




A 


AX 


cj 


2 

4 


conditional jump (not taken) 
conditional jump (taken) 




B 


BX 


ajw 


1 


adjust workspace 




C 


OX 


eqc 


2 


equals constant 




D 


DX 


sti 


1 


store local 




E 


EX 


stnl 


2 


store non-local 




F 


FX 


opr 


- 


operate 





Table 4.8 IMS T414 arithmetic/logical operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


46 


24F6 


and 


1 


and 




4B 


24FB 


or 


1 


or 




33 


23F3 


xor 


1 


exclusive or 




32 


23F2 


not 


1 


bitwise not 




41 


24F1 


shI 


n+2 


shift left 




40 


24F0 


shr 


n+2 


shift right 




05 


F5 


add 


1 


add 


E 


OC 


FC 


sub 


1 


subtract 


E 


53 


25F3 


mul 


38 


multiply 


E 


72 


27F2 


fmul 


35 


fractional multiply (no rounding) 


E 








40 


fractional multiply (rounding) 


E 


20 


22FC 


div 


39 


divide 


E 


1F 


21 FF 


rem 


37 


remainder 


E 


09 


F9 


gt 


2 


greater than 




04 


F4 


diff 


1 


difference 




52 


25F2 


sum 


1 


sum 




08 


F8 


prod 


b+4 


product 
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Table 4.9 IMS T414 long arithmetic operation codes 




Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


16 


21 F6 


ladd 


2 


long add 


E 


38 


23F8 


Isub 


2 


long subtract 


E 


37 


23F7 


Isum 


3 


long sum 




4F 


24FF 


Idiff 


3 


long diff 




31 


23F1 


Imul 


33 


long multiply 




1A 


21 FA 


Idiv 


35 


long divide 


E 


36 


23F6 


IshI 


n+3 
n-28 


long shift left (n<32) 
long shift left(n>32) 




35 


23F5 


Ishr 


n+3 
n-28 


long shift right (n<32) 
long shift right (n>32) 




19 


21 F9 


norm 


n+5 
n-26 

3 


normalise (n<32) 
normalise (n>32) 
normalise (n=64) 





Table 4.10 IMS T414 floating point support operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


73 
63 
6D 
60 

71 


27F3 
26F3 
26FD 
26FG 

27F1 


cflerr 
unpacksn 
roundsn 
postnormsn 

Idinf 


3 

15 
12/15 
5/30 

1 


check floating point error 
unpack single length fp number 
round single length fp number 
post-normalise correction of 
single length fp number 

load single length infinity 


E 



Processor cycles are shown as Typical/Maximum cycles. 



Table 4.1 1 IMS T414 general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


00 

3A 
56 
ID 
40 
42 


FO 

23FA 
25F6 
21 FD 
24FC 
24F2 


rev 

xword 
cword 
xdble 
csngI 
mint 


1 

4 
5 
2 
3 

1 


reverse 

extend to word 
check word 
extend to double 
check single 
minimum integer 


E 
E 
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Table 4.12 IMS T414 indexing/array operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


02 
OA 
34 
3F 


F2 

FA 

23F4 

23FF 


bsub 
wsub 
bent 
went 


1 
2 
2 
5 


byte subscript 
word subscript 
byte count 
word count 




01 
3B 


F1 
23FB 


lb 
sb 


5 
4 


load byte 
store byte 




4A 


24FA 


move 


2w+8 


move message 





Table 4.13 IMS T414 timer handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


22 


22F2 


Idtimer 


2 


load timer 




2B 
4E 


22FB 
24FE 


tin 
talt 


30 
4 
4 


timer Input (time future) 
timer input (time past) 
timer alt start 


D 
D 


51 
47 


25F1 
24F7 


taltwt 
enbt 


15 
48 
8 


timer alt wait (time past) 
timer alt wait (time future) 
enable timer 


D 
D 


2E 


22FE 


dist 


23 


disable timer 





Table 4.14 IMS T414 input/output operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


07 
OB 
OF 
OE 


F7 
FB 
FF 
FE 


in 

out 

outword 
outbyte 


2W+19 
2W+19 

23 

23 


input message 
output message 
output word 
output byte 


D 
D 
D 
D 


43 


24F3 


alt 


2 


alt start 




44 
45 


24F4 
24F5 


altwt 
altend 


5 

17 

4 


alt wait (channel ready) 
alt wait (channel not ready) 
alt end 


D 
D 


49 
30 


24F9 
23F0 


enbs 
diss 


3 

4 


enable skip 
disable skip 




12 


21 F2 


resetch 


3 


reset channel 




48 
2F 


24F8 
22FF 


enbc 
disc 


7 
5 
8 


enable channel (ready) 
enable channel (not ready) 
disable channel 
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Table 4.15 IMS T414 control operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


20 
1B 
3C 
06 
21 


22F0 
21 FB 
23FC 

F6 
22F1 


ret 

Idpi 

gajw 

gcall 

lend 


5 
2 
2 

4 
10 
5 


return 

load pointer to instruction 

general adjust workspace 

general call 

loop end (loop) 

loop end (exit) 


D 
D 



Table 4.16 IMS T414 scheduling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


OD 


FD 


startp 


12 


start process 


D 


03 


F3 


endp 


13 


end process 


D 


39 


23F9 


runp 


10 


run process 




15 


21 F5 


stopp 


11 


stop process 




IE 


21 FE 


Idpri 


1 


load current priority 





Table 4.17 IMS T414 error handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 
E 


13 
4D 


21 F3 
24FD 


csubO 
ccnti 


2 
3 


check subscript from 
check count from 1 


E 
E 


29 
10 


22F9 
21 FO 


testerr 
seterr 


2 

3 

1 


test error false and clear (no error) 
test error false and clear (error) 
set error 


E 


55 
57 


25F5 
25F7 


stoperr 
cirhalterr 


2 

1 


stop on error (no error) 
clear halt-on-error 


D 


58 


25F8 


sethalterr 


1 


set halt-on-error 




59 


25F9 


testhalterr 


2 


test halt-on-error 





Table 4.18 IMS T414 processor initialisation operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


2A 


22FA 


testpranal 


2 


test processor analysing 




3E 


23FE 


saveh 


4 


save high priority queue registers 




3D 


23FD 


savel 


4 


save low priority queue registers 




18 


21 F8 


sthf 




store high priority front pointer 




50 


25F0 


sthb 




store high priority back pointer 




10 


21 FC 


stif 




store low priority front pointer 




17 


21 F7 


stib 




store low priority back pointer 




54 


25F4 


sttimer 




store timer 
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5 System services 

System services include all the necessary logic to initialise and sustain operation of the device. They also 
include error handling and analysis facilities. 



5.1 



Power 



Power is supplied to the device via the VCC and GND pins. Several of each are provided to minimise 
inductance within the package. All supply pins must be connected. The supply must be decoupled close to 
the chip by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer 
boards are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, otherwise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 



5.2 



CapPlus, CapMinus 



The internally derived power supply for internal clocks requires an external low leakage, low inductance VF 
capacitor to be connected between CapPlus and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz and 20 MHz. If a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total RGB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 









P.G.B. track 




VCC T 

r 


CapPlus 




1 






Decoupling 
4= capacitor 
VF 




Phase-locked 
loops 






1 






GND i- 


CapMinus 


P.C.B. track 









Figure 5.1 Recommended PLL decoupling 



5.3 



Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer devices may be connected to a common clock, or may have individual clocks providing 
each one meets the specified stability criteria. In a multi-clock system the relative phasing of Clockln clocks 
is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant provided the 
specified limits of Clockln pulse widths are met. 

Oscillator stability is important. Clockln must be derived from a crystal oscillator; RG oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 5.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1,2,4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1.3 


TDC1 DC2 


Difference In Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise time 






10 


ns 


1,5 


TDCf 


Clockln fall time 






8 


ns 


1,5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 10.3). 



1.5v 



TDCerror 

2.0vl 
0.8v- - - 



^^^s: 



: TDCerror 



TDCerror ^ 



TDCLDCH 




TDCHDCL 




eTDCf 



TDCLDCL 

90% 

10% 



' TDCerror 




Figure 5.2 Clockln timing 
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5.4 Reset 

Reset can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. After VCC is 
valid Clockln should be running for a minimum period TDCVRL before the end of Reset. The falling edge of 
Reset initialises the transputer, triggers the memory configuration sequence and starts the bootstrap routine. 
Link outputs are forced low during reset; link inputs and EventReq should be held low. Memory request 
(DMA) must not occur whilst Reset is high but can occur before bootstrap (page 376). 

After the end of Reset there will be a delay of 144 periods of Clockln (figure 5.3). Following this, the 
MemWrDO, MemRf D1 and MemAD2-31 pins will be scanned to check for the existence of a pre-programmed 
memory Interface configuration (page 366). This lasts for a further 144 periods of Clockln. Regardless of 
whether a configuration was found, 36 configuration read cycles will then be performed on external memory 
using the default memory configuration (page 368), in an attempt to access the external configuration ROM. 
A delay will then occur, its period depending on the actual configuration. Finally eight complete and con- 
secutive refresh cycles will initialise any dynamic RAM, using the new memory configuration. If the memory 
configuration does not enable refresh of dynamic RAM the refresh cycles will be replaced by an equivalent 
delay with no external memory activity. 

If BootFromRom Is high bootstrapping will then take place immediately, using data from external memory; 
othenA/ise the transputer will await an input from any link. The processor will be in the low priority state. 



Reset L 

Action [- 



Delay coSraln conSraln Delay Refresh Boot 



Figure 5.3 IMS T414 post-reset sequence 



5.5 Bootstrap 



The transputer can be bootstrapped either from a link or from external ROM. To facilitate debugging, Boot- 
FromRom may be dynamically changed but must obey the specified timing restrictions. It is sampled once 
only by the transputer, before the first instruction is executed after Reset is taken low. 

If BootFromRom is connected high (e.g. to VCC) the transputer starts to execute code from the top two bytes 
in external memory, at address #7FFFFFFE. This location should contain a backward jump to a program in 
ROM. Following this access, BootFromRom may be taken low if required. The processor is in the low priority 
state, and the IV register points to MemStart (page 355). 

If BootFromRom is connected low (e.g. to GND) the transputer will wait for the first bootstrap message to 
arrive on any one of its links. The transputer is ready to receive the first byte on a link within two processor 
cycles TPCLPCL after Reset goes low. 

If the first byte received (the control byte) is greater than 1 it is taken as the quantity of bytes to be input. The 
following bytes, to that quantity, are then placed in internal memory starting at location MemStart. Following 
reception of the last byte the transputer will start executing code at MemStart as a low priority process. 
BootFromRom may be taken high after reception of the last byte, if required. The memory space immediately 
above the loaded code is used as work space. Messages arriving on other links after the control byte has 
been received and on the bootstrapping link after the last bootstrap byte will be retained until a process inputs 
from them. 
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Table 5.2 Reset and Analyse 



SYMBOL 


PARAfVIETER 


IVIIN 


NOIVI 


IVIAX 


UNITS 


NOTE 


TPVRH 


Power valid before Reset 


10 






ms 




TRHRL 


Reset pulse width high 


8 






Clockln 


1 


TDCVRL 


Clockln running before Reset end 


10 






ms 


2 


TAHRH 


Analyse setup before Reset 


3 






ms 




TRLAL 


Analyse hold after Reset end 


1 






Clockln 


1 


TBRVRL 


BootFromRom setup 









ms 




TRLBRX 


BootFromRom hold after Reset 









ms 


3 


TALBRX 


BootFromRom hold after Analyse 










3 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 

3 Must be stable until after end of bootstrap period. See Bootstrap section. 



Clockln ~\Z:\ \////////////////7\\///////7 ' 
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Figure 5.4 Transputer reset timing with Analyse low 
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Figure 5.5 Transputer reset and analyse timing 
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5.6 Peek and poke 

Any location in internal or external memory can be interrogated and altered when the transputer is waiting 
for a bootstrap from link. If the control byte is then eight more bytes are expected on the same link. The 
first four byte word is taken as an internal or external memory address at which to poke (write) the second 
four byte word. If the control byte is 1 the next four bytes are used as the address from which to peek (read) 
a word of data; the word is sent down the output channel of the same link. 

Following such a peek or poke, the transputer returns to its previously held state. Any number of accesses 
may be made in this way until the control byte is greater than 1 , when the transputer will commence reading 
its bootstrap program. Any link can be used, but addresses and data must be transmitted via the same link 
as the control byte. 

5.7 Analyse 

If Analyse is taken high when the transputer is running, the transputer will halt at the next descheduling point 
(page 344). From Analyse being asserted, the processor will halt within three time slice periods plus the 
time taken for any high priority process to complete. As much of the transputer status is maintained as is 
necessary to permit analysis of the halted machine. Processor flags Error and HaltOnError are not altered 
at reset, whether Analyse is asserted or not. Memory refresh continues. 

Input links will continue with outstanding transfers. Output links will not make another access to memory 
for data but will transmit only those bytes already in the link buffer. Providing there is no delay in link 
acknowledgement, the links should be inactive within a few microseconds of the transputer halting. 

Reset should not be asserted before the transputer has halted and link transfers have ceased. When Reset 
is taken low whilst Analyse is high, neither the memory configuration sequence nor the block of eight refresh 
cycles will occur; the previous memory configuration will be used for any external memory accesses. If 
BootFromRom is high the transputer will bootstrap as soon as Analyse is taken low, otherwise it will await a 
control byte on any link. If Analyse is taken low without Reset going high the transputer state and operation 
are undefined. After the end of a valid Analyse sequence the registers have the values given in table 5.3. 

Table 5.3 Register values after Analyse 



/ MemStart if bootstrapping from a link, or the external memory bootstrap address if 
bootstrapping from ROM. 

W MemStart if bootstrapping from ROM, or the address of the first free word after the 
bootstrap program if bootstrapping from link. 

A The value of /when the processor halted. 

B The value of IV when the processor halted, together with the priority of the process 
when the transputer was halted (i.e. the IV descriptor). 

C The ID of the bootstrapping link if bootstrapping from link. 
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5.8 



Error 



The Error pin is connected directly to the internal Error flag and follows the state of that flag. If Error is 
high it indicates an error in one of the processes caused, for example, by arithmetic overflow, divide by zero, 
array bounds violation or software setting the flag directly (page 344). Once set, the Error flag is only cleared 
by executing the instruction testerr. The error is not cleared by processor reset, in order that analysis can 
identify any errant transputer (page 353). 

A process can be programmed to stop if the Error flag is set; it cannot then transmit erroneous data to other 
processes, but processes which do not require that data can still be scheduled. Eventually all processes 
which rely, directly or indirectly, on data from the process in error will stop through lack of data. 

By setting the HaltOnErrorilag the transputer itself can be programmed to halt if Error becomes set. If Error 
becomes set after HaltOnErrorhas been set, all processes on that transputer will cease but will not necessarily 
cause other transputers in a network to halt. Setting HaltOn Error a^er Error will not cause the transputer to 
halt; this allows the processor reset and analyse facilities to function with the flags in indeterminate states. 

An alternative method of error handling is to have the errant process or transputer cause all transputers to 
halt. This can be done by applying the Error output signal of the errant transputer to the EventReq pin of a 
suitably programmed master transputer. Since the process state is preserved when stopped by an error, the 
master transputer can then use the analyse function to debug the fault. When using such a circuit, note that 
the Error flag is in an indeterminate state on power up; the circuit and software should be designed with this 
in mind. 



Error checks can be removed completely to optimise the performance of a proven program; any unexpected 
error then occurring will have an arbitrary undefined effect. 

If a high priority process pre-empts a low priority one, status of the Error and HaltOnError flags is saved 
for the duration of the high priority process and restored at the conclusion of it. Status of the Error flag is 
transmitted to the high priority process but the HaltOnError 1\aQ is cleared before the process starts. Either 
flag can be altered in the process without upsetting the error status of any complex operation being carried 
out by the pre-empted low priority process. 

In the event of a transputer halting because of HaltOnError, the links will finish outstanding transfers before 
shutting down. If Analyse is asserted then all inputs continue but outputs will not make another access to 
memory for data. Memory refresh will continue to take place. 

After halting due to the Error flag changing from to 1 whilst HaltOnError \s set, register /points two bytes 
past the instruction which set Error. After halting due to the Analyse pin being taken high, register / points 
one byte past the instruction being executed. In both cases /will be copied to register A. 
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Transputer 
2 
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Slave 
Transputer 

1 



Error 



Slave 

Transputer 

3 



Error[3] 



Figure 5.6 Error handling in a multi-transputer system 
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6 Memory 

The IMS T414 has 2 Kbytes of fast internal static memory for high rates of data throughput. Each internal 
memory access takes one processor cycle ProcClockOut (page 357). The transputer can also access 
4 Gbytes of external memory space. Internal and external memory are part of the same linear address 
space. Internal RAM can be disabled by holding DisablelntRAM high. All internal addresses are then 
mapped to external RAM. This pin should not be altered after Reset has been taken low. 

IMS T414 memory is byte addressed, with words aligned on four-byte boundaries. The least significant byte 
of a word is the lowest addressed byte. 

The bits in a byte are numbered to 7, with bit the least significant. The bytes are numbered from 0, with 
byte the least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Internal memory starts at the most negative address #80000000 and extends to #800007FF. User memory 
begins at #80000048; this location is given the name MemStart. 

The reserved area of internal memory below MemStart is used to implement link and event channels. 

Two words of memory are reserved for timer use, TPtrLocO for high priority processes and TPtrLod for low 
priority processes. They either indicate the relevant priority timer is not in use or point to the first process on 
the timer queue at that priority level. 

Values of certain processor registers for the current low priority process are saved in the reserved IntSaveLoc 
locations when a high priority process pre-empts a low priority one. 

External memory space starts at #80000800 and extends up through #00000000 to #7FFFFFFF. Memory 
configuration data and ROM bootstrapping code must be in the most positive address space, starting at 
#7FFFFF6C and #7FFFFFFE respectively. Address space immediately below this is conventionally used for 
ROM based code. 
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hi Machine map lo 


Byte address 


Word offsets 


Occam map 






Reset Inst 


#7FFFFFFE 

#7FFFFFF8 

#7FFFFF60 

#0 

#80000800 - Start of external memory - #0200 

#80000048 MemStart MemStart #12 






Memory configuration 








EreglntSaveLoc 


#80000044 N 

#80000040 

#80000030 

#80000038 

#80000034 

#80000030 

#80000020 

#80000028 

#80000024 

#80000020 

#80000010 

#80000018 

#80000014 

#80000010 

#80000000 

#80000008 

#80000004 

#80000000 y 


>Note1 ^,3 

#07 
#06 
#05 
#04 
#03 
#02 
#01 
(Base of memory) #00 




STATUSIntSaveLoc 


OreglntSaveLoc 


BreglntSaveLoc 


AreglntSaveLoc 


IptrlntSaveLoc 


WdesclntSaveLoc 


TPtrLocI 


TPtrLocO 


Event 


Event 


Link 3 input 


Link 3 Input 


Link 2 Input 


Link 2 Input 


Link 1 Input 


Link 1 Input 


Link Input 


Link Input 


Link 3 Output 


Link 3 Output 


Link 2 Output 


Link 2 Output 


Link 1 Output 


Link 1 Output 


Link Output 


Link Output 















Notes 



Figure 6.1 IMS T414 memory map 



1 These locations are used as auxiliary processor registers and should not be manipulated by the user. Like 
processor registers, their contents may be useful for implementing debugging tools (Analyse, page 353). For 
details see Transputer Instruction Set - A Compiler Writers' Guide. 
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7 External memory interface 

The External Memory Interface (EMI) allows access to a 32 bit address space, supporting dynamic and static 
RAM as well as ROM and EPROM. EMI timing can be configured at Reset to cater for most memory types 
and speeds, and a program is supplied with the Transputer Development System to aid in this configuration. 

There are 13 internal configurations which can be selected by a single pin connection (page 366). If none 
are suitable the user can configure the interface to specific requirements, as shown in page 368. 

7.1 ProcClockOut 

This clock is derived from the internal processor clock, which is in turn derived from Clockln. Its period is 
equal to one internal microcode cycle time, and can be derived from the formula 

TPCLPCL = TDCLDCL / PLLx 

where TPCLPCL is the ProcClockOut Period, TDCLDCL is the Clockln Period and PLLx is the phase 
lock loop factor for the relevant speed part, obtained from the ordering details (Ordering section). 

The time value Tm is used to define the duration of Tstates and, hence, the length of external memory cycles; 
its value is exactly half the period of one ProcClockOut cycle (0.5*TPCLPCL), regardless of mark/space 
ratio of ProcClockOut. 

Edges of the various external memory strobes coincide with rising or falling edges of ProcClockOut. It should 
be noted, however, that there is a skew associated with each coincidence. The value of skew depends on 
whether coincidence occurs when the ProcClockOut edge and strobe edge are both rising, when both are 
falling or if either is rising when the other is falling. Timing values given in the strobe tables show the best 
and worst cases. If a more accurate timing relationship is required, the exact Tstate timing and strobe edge 
to ProcClockOut relationships should be calculated and the correct skew factors applied from the edge skew 
timing table 7.4. 

7.2 Tstates 

The external memory cycle is divided into six Tstates with the following functions: 
T1 Address setup time before address valid strobe. 
T2 Address hold time after address valid strobe. 
T3 Read cycle tristate or write cycle data setup. 
T4 Extendable data setup time. 
T5 Read or write data. 
T6 Data hold. 

Under normal conditions each Tstate may be from one to four periods Tm long, the duration being set during 
memory configuration. The default condition on Reset is that all Tstates are the maximum four periods Tm 
long to allow external initialisation cycles to read slow ROM. 

Period T4 can be extended indefinitely by adding externally generated wait states. 

An external memory cycle is always an even number of periods Tm in length and the start of T1 always 
coincides with a rising edge of ProcClockOut. If the total configured quantity of periods Tm is an odd 
number, one extra period Tm will be added at the end of T6 to force the start of the next T1 to coincide with 
a rising edge of ProcClockOut. This period is designated E in configuration diagrams (figure 7.11). 
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Table 7.1 ProcClockOut 



SYiViBOL 


PARAIVIETER 


IVIIN 


NOIVI 


IVIAX 


UNITS 


NOTE 


TPCLPCL 


ProcClockOut period 


a-1 


a 


a+1 


ns 


1 


TPCHPCL 


ProcClockOut pulse width high 


b-2.5 


b 


b+2.5 


ns 


2 


TPCLPCH 


ProcClockOut pulse width low 









ns 


3 


Tm 


ProcClockOut half cycle 


b-0.5 


b 


b+0.5 


ns 


2 


TPCstab 


ProcClockOut stability 






4 


% 


4 



Notes 



1 a is TDCLDCL/PLLx. 

2 b is 0.5*TPCLPCL (half the processor clock period). 

3 c is TPCLPCL-TPCHPCL. 

4 Stability is the variation of cycle periods between two consecutive cycles, measured at corresponding points on 
the cycles. 
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Figure 7.1 IMS T414 ProcClockOut timing 



7.3 



Internal access 



During an internal memory access cycle the external memory interface bus MemAD2-31 reflects the word 
address used to access internal RAM, MemnotWrDO reflects the read/write operation and MemnotRfDI is 
high; all control strobes are inactive. This is true unless and until a memory refresh cycle or DMA (memory 
request) activity takes place, when the bus will carry the appropriate external address or data. 

The bus activity is not adequate to trace the internal operation of the transputer in full, but may be used for 
hardware debugging in conjuction with peek and poke (page 353). 
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Figure 7.2 IMS T414 bus activity for internal memory cycle 
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7.4 MemAD2-31 

External memory addresses and data are multiplexed on one bus. Only the top 30 bits of address are 
output on the external memory interface, using pins MemAD2-31. They are normally output only during 
Tstates T1 and T2, and should be latched during this time. Byte addressing is carried out internally by the 
transputer for read cycles. For write cycles the relevant bytes in memory are addressed by the write strobes 
notMemWrBO-3. 

The data bus is 32 bits wide. It uses MemAD2-31 for the top 30 bits and MemnotRfDI and MemnotWrDO 
for the lower two bits. Read cycle data may be set up on the bus at any time after the start of 13, but must 
be valid when the transputer reads it at the end of 15. Data may be removed any time during T6, but must 
be off the bus no later than the end of that period. 

Write data is placed on the bus at the start of 13 and removed at the end of T6. If T6 is extended to force 
the next cycle Tmx (page 359) to start on a rising edge of ProcClockOut, data will be valid during this time 
also. 

7.5 MemnotWrDO 

During 11 and T2 this pin will be low if the cycle is a write cycle, otherwise it will be high. During Tstates T3 
to T6 it becomes bit of the data bus. In both cases it follows the general timing of MemAD2-31. 

7.6 MemnotRfDI 

During T1 and T2, this pin is low if the address on MemAD2-31 is a refresh address, othen/vise it is high. 
During Tstates T3 to T6 it becomes bit 1 of the data bus. In both cases it follows the general timing of 
MemAD2-31. 

7.7 notMemRd 

For a read cycle the read strobe notMemRd is low during T4 and T5. Data is read by the transputer on the 
rising edge of this strobe, and may be removed immediately aften/vard. If the strobe duration is insufficient it 
may be extended by adding extra periods Tm to either or both of the Tstates T4 and T5. Further extension 
may be obtained by inserting wait states at the end of T4. 

In the read cycle timing diagrams ProcClockOut is included as a guide only; it is shown with each Tstate 
configured to one period Tm. 

7.8 notMemSO-4 

To facilitate control of different types of memory and devices, the EMI is provided with five strobe outputs, 
four of which can be configured by the user. The strobes are conventionally assigned the functions shown in 
the read and write cycle diagrams, although there is no compulsion to retain these designations. 

notMemSO is a fixed format strobe. Its leading edge is always coincident with the start of T2 and its trailing 
edge always coincident with the end of T5. 

The leading edge of notMemSI is always coincident with the start of T2, but its duration may be configured 
to be from zero to 31 periods Tm. Regardless of the configured duration, the strobe will terminate no later 
than the end of T6. The strobe is sometimes programmed to extend beyond the normal end of Tmx. When 
wait states are inserted into an EMI cycle the end of Tmx is delayed, but the potential active duration of the 
strobe is not altered. Thus the strobe can be configured to terminate relatively early under certain conditions 
(page 374). If notMemSI is configured to be zero it will never go low. 
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notl\/IemS2, notiVlemSS and not!\/lemS4 are identical in operation. They all terminate at the end of T5, but 
the start of each can be delayed from one to 31 periods Tm beyond the start of T2. If the duration of one of 
these strobes would take it past the end of T5 it will stay high. This can be used to cause a strobe to become 
active only when wait states are inserted. If one of these strobes is configured to zero it will never go low. 
Figure 7.5 shows the effect of Wait on strobes in more detail; each division on the scale is one period Tm. 

Table 7.2 Read 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaZdV 


Address tristate to data valid 









ns 




TdVRdH 


Data setup before read 


20 






ns 




TRdHdX 


Data hold after read 









ns 




TSOLRdL 


notMemSO before start of read 


a-2 


a 


a+2 


ns 


1 


TSOHRdH 


End of read from end of notMemSO 


-1 




1 


ns 




TRdLRdH 


Read period 


b 




b+6 


ns 


2 



Notes 



1 a is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

2 b is total of T4+Twait+T5 where T4, T5 can be from one to four periods Tm each in length and Twait may be 
any number of periods Tm in length. 
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Figure 7.3 IMS T414 external read cycle: static memory 
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Figure 7.4 IMS T414 external read cycle: dynamic memory 
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Table 7.3 IMS T414 strobe timing 



SYMBOL 


(n) 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TaVSOL 




Address setup before notMemSO 




a 




ns 


1 


TSOLaX 




Address hold after notMemSO 




b 




ns 


2 


TSOLSOH 




notMemSO pulse width low 


c 




C+6 


ns 


3 


TS0LS1L 


1 


notMemSI from notMemSO 







2 


ns 




TS0LS1H 


5 


notMemSI end from notMemSO 


d 




d+6 


ns 


4,6 


TS0HS1H 


9 


notMemSI end from notMemSO end 


e-1 




e+4 


ns 


5.6 


TS0LS2L 


2 


notMemS2 delayed after notMemSO 


M 




f+4 


ns 


7 


TS0LS2H 


6 


notMemS2 end from notMemSO 


c+4 




c+8 


ns 


3 


TS0HS2H 


10 


notMemS2 end from notMemSO end 







2 


ns 




TS0LS3L 


3 


notMemS3 delayed after notMemSO 


M 




f+3 


ns 


7 


TS0LS3H 


7 


notMemS3 end from notMemSO 


c+4 




C+8 


ns 


3 


TS0HS3H 


11 


notMemS3 end from notMemSO end 







2 


ns 




TS0LS4L 


4 


notMemS4 delayed after notMemSO 


f-1 




f+2 


ns 


7 


TS0LS4H 


8 


notMemS4 end from notMemSO 


c+4 




c+8 


ns 


3 


TS0HS4H 


12 


notMemS4 end from notMemSO end 







2 


ns 




Tmx 




Complete external memory cycle 




g 






8 



Notes 



1 a is T1 where T1 can be from one to four periods Tm in length. 

2 b is T2 where T2 can be from one to four periods Tm in length. 

3 c Is total of T2+T3+T4+Twait+T5 where T2, T3, T4, T5 can be from one to four periods Tnn each in length and 
Twait may be any number of periods Tm in length. 

4 d can be from zero to 31 periods Tm in length. 

5 e can be from -27 to +4 periods Tm in length. 

6 If the configuration would cause the strobe to remain active past the end of T6 it will go high at the end of T6. 
If the strobe is configured to zero periods Tm it will remain high throughout the complete cycle Tmx. 

7 f can be from zero to 31 periods Tm in length. If this length would cause the strobe to remain active past the 
end of T5 it will go high at the end of T5. If the strobe value is zero periods Tm it will remain low throughout 
the complete cycle Tmx. 

8 g is one complete external memory cycle comprising the total of T1+T2+T3+T4+Twalt+T5+T6 where T1, T2, 
T3, T4, T5 can be from one to four periods Tm each in length, T6 can be from one to five periods Tm in length 
and Twalt may be zero or any number of periods Tm in length. 



Tstate|Tl|T2|T3|T4|T5|T6|Tl| 
notMemSI | | 


Tstate|Tl|T2|T3|T4|W|W|T5|T6|Tl| 


notMemSI | 1 




notMemS2 

No wait states 


notMemS2 | 1 


Wait states inserted 



Figure 7.5 IMS T41 4 effect of wait states on strobes 
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Table 7.4 Strobe SO to ProcClockOut skew 



SYMBOL 



PARAMETER 



Strobe rising from ProcClockOut rising 
Strobe rising from ProcClockOut falling 
Strobe falling from ProcClockOut rising 
Strobe falling from ProcClockOut falling 



MIN 



NOM 



MAX UNITS NOTE 



TPCHSOH 
TPCLSOH 
TPCHSOL 
TPCLSOL 




1 

-3 
-1 



3 


ns 


4 


ns 





ns 


2 


ns 



ProcClockOut 



i 



i 



NotMemSO 



^ [-*■ TPCHSOH ^ |-^T 



-TPCHSOL 



rTPCLSOH 



^TPCLSOL 



Figure 7.6 IMS T414 skew of notMemSO to ProcClockOut 



7.9 



notMemWrBO-3 



Because the transputer uses word addressing, four write strobes are provided; one to write each byte of the 
word. If a particular byte is not to be written, then the corresponding data outputs are tristated. notMemWrBO 
addresses the least significant byte. 

The transputer has both early and late write cycle modes. For a late write cycle the relevant write strobes 
notMemWrBO-3 are low during T4 and T5; for an early write they are also low during T3. Data should be 
latched into memory on the rising edge of the strobes in both cases, although it is valid until the end of T6. 
If the strobe duration is insufficient, it may be extended at configuration time by adding extra periods Tm to 
either or both of Tstates T4 and T5 for both early and late modes. For an early cycle they may also be added 
to T3. Further extension may be obtained by inserting wait states at the end of T4. If the data hold time is 
insufficient, extra periods Tm may be added to T6 to extend it. 



Table 7.5 Write 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TdVWrH 


Data setup before write 


d 






ns 


1,5 


TWrHdX 


Data hold after write 


a 






ns 


1,2 


TSOLWrL 


notMemSO before start of early write 


b-3 




b+2 


ns 


1,3 




notMemSO before start of late write 


c-3 




c+2 


ns 


1,4 


TSOHWrH 


End of write from end of notMemSO 


-2 




2 


ns 


1 


TWrLWrH 


Early write pulse width 


d 




d+6 


ns 


1,5 




Late write pulse width 


e 




e+6 


ns 


1,6 



Notes 



1 Timing is for all write strobes notMemWrBO-3. 

2 a is T6 where T6 can be from one to five periods Tm in length. 

3 b is T2 where T2 can be from one to four periods Tm in length. 

4 c is total of T2+T3 where T2, T3 can be from one to four periods Tm each in length. 

5 d is total of T3+T4+Twait+T5 where T3, T4, T5 can be from one to four periods Tm each in length and Twait 
may be zero or any number of periods Tm in length. 

6 e is total of T4+Twait+T5 where T4, T5 can be from one to four periods Tm each in length and Twait may be 
zero or any number of periods Tm in length. 
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Figure 7.7 IMS T414 external write cycle 

In the write cycle timing diagrann ProcClockOut is included as a guide only; it is shown with each Tstate 
configured to one period Tm. The strobe is inactive during internal memory cycles. 
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Figure 7.8 IMS T414 dynamic RAM application 
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7.10 IViemConfig 

IVIemConfig is an input pin used to read configuration data when setting external memory interface (EMI) 
characteristics. It is read by the processor on two occasions after Reset goes low; first to check if one of the 
preset internal configurations is required, then to determine a possible external configuration. 

7.10.1 Internal configuration 

The internal configuration scan comprises 64 periods TDCLDCL of Cloclcln during the internal scan period 
of 144 Clockln periods. MemnotWrDO, MemnotRfDI and MemAD2-32 are all high at the beginning of the 
scan. Starting with MemnotWrDO, each of these lines goes low successively at intervals of two Clockln 
periods and stays low until the end of the scan. If one of these lines is connected to MemConfig the preset 
internal configuration mode associated with that line will be used as the EMI configuration. The default 
configuration is that defined in the table for MemADSI; connecting MemConfig to VCC will also produce 
this default configuration. Note that only 13 of the possible configurations are valid, all others remain at the 
default configuration. 

Table 7.6 IMS T414 internal configuration coding 





Duration of each Tstate 




Strobe 




Write 


Refresh 


Cycle 






periods Tm 






coefficient 




cycle 


interval 


time 
















Clockln 


Proc 


Pin 


11 


12 13 14 15 


T6 


Si 


s2 S3 


s4 


type 


cycles 


cycles 


MemnotWrDO 


1 


1111 


1 


30 


1 3 


5 


late 


72 


3 


MemnotRfDI 


1 


2 111 


2 


30 


1 2 


7 


late 


72 


4 


MemAD2 


1 


2 112 


3 


30 


1 2 


7 


late 


72 


5 


MemADS 


2 


3 112 


3 


30 


1 3 


8 


late 


72 


6 


MemAD4 


1 


1111 


1 


3 


1 2 


3 


early 


72 


3 


MemADS 


1 


12 12 


1 


5 


1 2 


3 


early 


72 


4 


MemADS 


2 


12 13 


1 


6 


1 2 


3 


early 


72 


5 


MemAD7 


2 


2 2 13 


2 


7 


1 3 


4 


early 


72 


6 


MemADS 


1 


1111 


1 


30 


1 2 


3 


early 


t 


3 


MemAD9 


1 


12 12 


1 


30 


2 5 


9 


early 


t 


4 


MemADIO 


2 


2 2 2 4 


2 


30 


2 3 


8 


late 


72 


7 


MemADII 


3 


3 3 3 3 


3 


30 


2 4 


13 


late 


72 


9 


MemADSI 


4 


4 4 4 4 


4 


31 


30 30 


18 


late 


72 


12 



t Provided for static RAM only. 
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Figure 7.9 IMS T414 internal configuration 
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Figure 7.10 If^S T414 internal configuration scan 

7.10.2 External configuration 

If MemConfig is held low until MemnotWrDO goes low the internal configuration is ignored and an external 
configuration will be loaded instead. An external configuration scan always follows an internal one, but if an 
internal configuration occurs any external configuration is ignored. 

The external configuration scan comprises 36 successive external read cycles, using the default EMI con- 
figuration preset by MemADSL However, instead of data being read on the data bus as for a normal read 
cycle, only a single bit of data is read on MemConfig at each cycle. Addresses put out on the bus for each 
read cycle are shown in table 7.7, and are designed to address ROM at the top of the memory map. The 
table shows the data to be held in ROM; data required at the MemConfig pin is the inverse of this. 

MemConfig is typically connected via an inverter to MemnotWrDO. Data bit zero of the least significant byte 
of each ROM word then provides the configuration data stream. By switching MemConfig between various 
data bus lines up to 32 configurations can be stored in ROM, one per bit of the data bus. MemConfig can be 
permanently connected to a data line or to GND. Connecting MemConfig to GND gives all Tstates configured 
to four periods; notMemSI pulse of maximum duration; notMemS2-4 delayed by maximum; refresh interval 
72 periods of Clockln; refresh enabled; late write. 

The external memory configuration table 7.7 shows the contribution of each memory address to the 13 con- 
figuration fields. The lowest 12 words (#7FFFFF6G to #7FFFFF98, fields 1 to 6) define the number of extra 
periods Tm to be added to each Tstate. If field 2 is 3 then three extra periods will be added to T2 to extend 
it to the maximum of four periods. 

The next five addresses (field 7) define the duration of notMemSI and the following fifteen (fields 8 to 10) 
define the delays before strobes notMemS2-4 become active. The five bits allocated to each strobe allow 
durations of from to 31 periods Tm, as described in strobes page 359. 

Addresses #7FFFFFEC to #7FFFFFF4 (fields 1 1 and 12) define the refresh interval and whether refresh is to 
be used, whilst the final address (field 13) supplies a high bit to MemConfig if a late write cycle is required. 



The columns to the right of the coding table show the values of each configuration bit for the four sample 
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external configuration diagrams. Note the inclusion of period E at the end of 16 in some diagrams. This is 
inserted to bring the start of the next Tstate T1 to coincide with a rising edge of ProcClockOut (page 357). 

Wait states W have been added to show the effect of them on strobe timing; they are not part of a configuration. 
In each case which includes wait states, two wait periods are defined. This shows that if a wait state would 
cause the start of T5 to coincide with a falling edge of ProcClockOut, another period Tm is generated by 
the EMI to force it to coincide with a rising edge of ProcClockOut. This coincidence is only necessary if wait 
states are added, otherwise coincidence with a falling edge is permitted. Any configuration memory access 
is only permitted to be extended using wait, up to a total of 14 Clockln periods. 
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Figure 7.11 IMS T414 external configuration 
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® IVIemConfig connected to inverse of MemnotWrDO 

© Configuration field 1 ; T1 configured for 2 periods Tm 

© Configuration field 2; T2 configured for 3 periods Tm 

® Configuration field 10; most significant bit of notMemS4 configured high 

® Configuration field 1 1 ; refresh interval configured for 36 periods Clockln 

© Configuration field 12; refresh enabled 

@ Configuration field 13; early write cycle 



Figure 7.12 IMS T414 external configuration scan 
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Table 7.7 IMS T414 external configuration coding 



Scan 
cycle 


MemAD 
address 


Field 


Function 


Example diagram | 


1 


2 


3 


4 


1 


7FFFFF6C 


1 - 


T1 least significant bit 














2 


7FFFFF70 


1 


T1 most significant bit 














3 


7FFFFF74 


2 


T2 least significant bit 


1 








1 


4 


7FFFFF78 


2 


T2 most significant bit 














5 


7FFFFF7C 


3 


T3 least significant bit 


1 


1 


1 


1 


6 


7FFFFF80 


3 


T3 most significant bit 














7 


7FFFFF84 


4 


T4 least significant bit 














8 


7FFFFF88 


4 


T4 most significant bit 














9 


7FFFFF8C 


5 


T5 least significant bit 














10 


7FFFFF90 


5 


T5 most significant bit 














11 


7FFFFF94 


6 


T6 least significant bit 


1 





1 


1 


12 


7FFFFF98 


6 


T6 most significant bit 














13 


7FFFFF9C 


7 


notMemSI least significant bit 








1 


1 


14 


7FFFFFA0 


7 
















15 


7FFFFFA4 


7 


^ * 














16 


7FFFFFA8 


7 




1 











17 


7FFFFFAC 


7 


notMemSI most significant bit 














18 


7FFFFFB0 


8 


notMemS2 least significant bit 


1 








1 


19 


7FFFFFB4 


8 




1 


1 





1 


20 


7FFFFFB8 


8 


* ^ 











1 


21 


7FFFFFBC 


8 
















22 


7FFFFFC0 


8 


notMemS2 most significant bit 














23 


7FFFFFC4 


9 


notMemS3 least significant bit 


1 


1 


1 


1 


24 


7FFFFFC8 


9 







1 








25 


7FFFFFCC 


9 


* ^ 





1 





1 


26 


7FFFFFD0 


9 










1 





27 


7FFFFFD4 


9 


notMemS3 most significant bit 














28 


7FFFFFD8 


10 


notMemS4 least significant bit 











1 


29 


7FFFFFDC 


10 







1 


1 


1 


30 


7FFFFFE0 


10 


* ^ 


1 


1 








31 


7FFFFFE4 


10 
















32 


7FFFFFE8 


10 


notMemS4 most significant bit 














33 


7FFFFFEC 


11 


Refresh Interval least significant bit 


- 


- 


- 


- 


34 


7FFFFFF0 


11 


Refresh Interval most significant bit 


- 


- 


- 


- 


35 


7FFFFFF4 


12 


Refresh Enable 


- 


- 


- 


- 


36 


7FFFFFF8 


13 


Late Write 





1 


1 
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Table 7.8 IMS T414 memory refresh cxDnfiguration coding 



Refresh 


Interval 


Field 11 


Complete 


interval 


in /iS 


encoding 


cycle (mS) 


18 


3.6 


00 


0.922 


36 


7.2 


01 


1.843 


54 


10.8 


10 


2.765 


72 


14.4 


11 


3.686 



Refresh intervals are in periods of Clockln and Clockln frequency is 5 MHz: 

Interval = 18 * 200 = 3600 ns 

Refresh interval is between successive incremental refresh addresses. 
Complete cycles are shown for 256 row DRAMS. 

Table 7.9 Memory configuration 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMCVRdH 
TRdHMCX 
TSOLRdH 


Memory configuration data setup 
Memory configuration data hold 
notMemSO to configuration data read 


30 

a 




a+6 


ns 
ns 
ns 


1 



Notes 



1 a is 16 periods Tm. 



Tstate I T1 , T2 | T3 | T4 , T5 | T6 
_____ I I I I I I I I I I I I I I I I I ,,, I ,, , 
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Figure 7.13 IMS T414 external configuration read cycle timing 
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7.11 



notMemRf 



The IMS T414 can be operated with memory refresh enabled or disabled. The selection is made during 
memory configuration, when the refresh interval is also determined. Refresh cycles do not interrupt internal 
memory accesses, although the internal addresses cannot be reflected on the external bus during refresh. 

When refresh is disabled no refresh cycles occur. During the post-Reset period eight dummy refresh cycles 
will occur with the appropriate timing but with no bus or strobe activity. 

A refresh cycle uses the same basic external memory timing as a normal external memory cycle, except that 
it starts two periods Tm before the start of T1. If a refresh cycle is due during an external memory access, 
it will be delayed until the end of that external cycle. Two extra periods Tm (periods R in the diagram) will 
then be inserted between the end of T6 of the external memory cycle and the start of T1 of the refresh cycle 
itself. The refresh address and various external strobes become active approximately one period Tm before 
T1. Bus signals are active until the end of T2, whilst notMemRf remains active until the end of T6. 

For a refresh cycle, MemnotRfDI goes low before notMemRf goes low and MemnotWrDO goes high with 
the same timing as MemnotRfDI. All the address lines share the same timing, but only MemAD2-ll give 
the refresh address. MemAD12-30 stay high during the address period, whilst MemAD31 remains low. 
Refresh cycles generate strobes notMemSO-4 with timing as for a normal external cycle, but notMemRd and 
notMemWrBO-3 remain high. MemWait operates normally during refresh cycles. 

Table 7.10 Memory refresh 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TRfLRfH 

TRaVSOL 

TRfLSOL 


Refresh pulse width low 

Refresh address setup before notMemSO 

Refresh indicator setup before notMemSO 


a 
T1-5 
b-5 




a+6 
b+5 


ns 
ns 
ns 


1 
2 



Notes 



1 a is total Tmx+Tm. 

2 b is total Tl+Tm where T1 can be from one to four periods Tnfi in length. 
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Figure 7.14 IMS T414 refresh cycle timing 
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IVIemWait 



Taking IVIemWait high with the timing shown will extend the duration of 14. MemWait is sampled relative 
to the falling edge of ProcCloclcOut during a T3 period, and should not change state in this region. By 
convention, notMemS4 is used to synchronize wait state insertion. If this or another strobe is used, its delay 
should be such as to take the strobe low an even number of periods Tm after the start of 11, to coincide with 
a rising edge of ProcClocl<Out. 

IVIemWait may be kept high indefinitely, although if dynamic memory refresh is used it should not be kept 
high long enough to interfere with refresh timing. MemWait operates normally during all cycles, including 
refresh and configuration cycles. It does not affect internal memory access in any way. 

If the start of T5 would coincide with a falling edge of ProcClockOut an extra wait period Tm (EW) is 
generated by the EMI to force coincidence with a rising edge. Rising edge coincidence is only forced if wait 
states are added, othen/vise coincidence with a falling edge is permitted. 
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Table 7.1 1 Memory wait 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCHWtH 
TPCHWtL 
TWtLWtH 


Wait setup 

Wait iiold 

Delay before re-assertion of Wait 


0.5Tm+3 

0.5Tm+3 

2Tm 






ns 
ns 
ns 


1,2 
1,2 



Notes 



1 ProcClockOut load should not exceed 50pf. 

2 If wait period exceeds refresh interval, refresh cycles will be lost. 
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Figure 7.15 IMS T414 memory wait timing 
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7.13 MemReq, IVlemGranted 

Direct memory access (DMA) can be requested at any time by taking tiie asynchronous IVIemReq input high. 
The transputer samples MemReq during the final period Tm of T6 of both refresh and external memory 
cycles. To guarantee taking over the bus immediately following either, MemReq must be set up at least two 
periods Tm before the end of T6. In the absence of an external memory cycle, MemReq is sampled during 
every low period of ProcClockOut. The address bus is tristated two periods Tm after the ProcCIockOut 
rising edge which follows the sample. MemGranted is asserted one period Tm after that. 

Removal of MemReq is sampled during each low period of ProcCIockOut and MemGranted is removed 
synchronously with the next falling edge of ProcCIockOut. If accurate timing of DMA is required, MemReq 
should be set low coincident with a falling edge of ProcCIockOut. Further external bus activity, either refresh, 
external cycles or reflection of internal cycles, will commence at the next rising edge of ProcCIockOut. 

Strobes are left in their inactive states during DMA. DMA cannot interrupt a refresh or external memory cycle, 
and outstanding refresh cycles will occur before the bus is released to DMA. DMA does not interfere with 
internal memory cycles in any way, although a program running in internal memory would have to wait for 
the end of DMA before accessing external memory. DMA cannot access internal memory. If DMA extends 
longer than one refresh interval (Memory Refresh Configuration Coding, table 7.8), the DMA user becomes 
responsible for refresh. DMA may also inhibit an internally running program from accessing external memory. 

DMA allows a bootstrap program to be loaded into external RAM ready for execution after reset. If MemReq is 
held high throughout reset, MemGranted will be asserted before the bootstrap sequence begins. MemReq 
must be high at least one period TDCLDCL of Clockln before Reset. The circuit should be designed to 
ensure correct operation if Reset could interrupt a normal DMA cycle. 

Table 7.12 Memory request 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TMRHMGH 
TMRLMGL 
TADZMGH 
TMGLADV 


Memory request response time 
Memory request end response time 
Bus tristate before memory granted 
Bus active after end of memory granted 


4 
2 


1 
1 


7 
5 


Tm 
Tm 
Tm 
Tm 


1 



Notes 



1 These values assume no external memory cycle is in progress. If an external cycle is active, maximum time 
could be (1 EMI cycle Tmx)+(1 refresh cycle TRfLRfH)+(6 periods Tm). 
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Figure 7.16 IMS T414 memory request timing 
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Figure 7.17 IMS T414 DMA sequence at reset 
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Figure 7.18 IMS T414 operation of MemReq, MemGranted with external, refresh memory cycles 
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Figure 7.19 IMS T414 operation of MemReq, MemGranted with external, internal memory cycles 
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Events 



EventReq and EventAck provide an asynchronous handshake interface between an external event and an 
internal process. When an external event takes EventReq high the external event channel (additional to the 
external link channels) is made ready to communicate with a process. When both the event channel and the 
process are ready the processor takes EventAck high and the process, if waiting, is scheduled. EventAck 
is removed after EventReq goes low. 

Only one process may use the event channel at any given time. If no process requires an event to occur 
EventAck will never be taken high. Although EventReq triggers the channel on a transition from low to high, 
it must not be removed before EventAck is high. EventReq should be low during Reset; if not it will be 
ignored until it has gone low and returned high. EventAck is taken low when Reset occurs. 

If the process is a high priority one and no other high priority process is running, the latency is as described 
on page 341 . Setting a high priority task to wait for an event input allows the user to interrupt a transputer 
program running at low priority. The time taken from asserting EventReq to the execution of the microcode 
interrupt handler in the CPU is four cycles. The following functions take place during the four cycles: 

Cycle 1 Sample EventReq at pad on the rising edge of ProcClockOut and synchronise. 

Cycle 2 Edge detect the synchronised EventReq and form the interrupt request. 

Cycle 3 Sample interrupt vector for microcode ROM in the CPU. 

Cycle 4 Execute the interrupt routine for Event rather than the next instruction. 

Table 8.1 Event 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TVHKH 
TKHVL 
TVLKL 
TKLVH 


Event request response 

Event request hold 

Delay before removal of event acknowledge 

Delay before re-assertion of event request 










a 


ns 
ns 
ns 
ns 


1 



Notes 



1 a is 3 processor cycles TPCLPCL. 
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Figure 8.1 IMS T41 4 event timing 
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9 Links 

Four identical INMOS bi-directional serial links provide synchronized communication between processors 
and with the outside world. Each link comprises an input channel and output channel. A link between two 
transputers is implemented by connecting a link interface on one transputer to a link interface on the other 
transputer. Every byte of data sent on a link is acknowledged on the input of the same link, thus each signal 
line carries both data and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged data byte 
and that the receiving link is able to receive another byte. The sending link reschedules the sending process 
only after the acknowledge for the final byte of the message has been received. 

The IMS T414 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they can 
be used at 5 or 20 Mbits/sec. Links are not synchronised with Clockln or ProcClockOut and are insensitive 
to their phases. Thus links from independently clocked systems may communicate, providing only that the 
clocks are nominally identical and within specification. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

Link speeds can be set by LinkSpecial, LinkOSpecial and Link123Special. The link speed can be 
set independently. Table 9.1 shows uni-directional and bi-directional data rates in Kbytes/sec for each link 
speed; LinknSpecial is to be read as LinkOSpecial when selecting link speed and as Link123Special for 
the others. Data rates are quoted for a transputer using internal memory, and will be affected by a factor 
depending on the number of external memory accesses and the length of the external memory cycle. 

Table 9.1 Speed Settings for Transputer Links 



Link 
Special 


Linkn 
Special 


Mbits/sec 


Kbytes/sec 


Uni 


Bi 





1 
1 



1 


1 


10 
5 
10 
20 


400 
200 
400 
800 


800 
400 
800 
1600 





H L, 
1 Ack 1 


H'H 01234567 L, 


1 Data 1 



Figure 9.1 IMS T414 link data and acknowledge packets 
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Table 9.2 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




10Mbits/s 






10 


ns 


2 




5 Mbits/s 






30 


ns 


2 


CLIZ 


Linkin capacitance @ f=l MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for 1 0Ofi transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 




Figure 9.2 IMS T414 link timing 



1 


LinkOut 1.5V- -^ 


/ 


Latest TJQJD -*- 
Earliest TJQJD *- 




-* 


"*■ 


Linkin 1 .5V -^ 


k- 
b 





TJBskew -^ 





Figure 9.3 IMS T414 buffered link timing 
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Figure 9.4 IMS T414 Links directly connected 
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Figure 9.5 IMS T414 Links connected by transmission line 
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Figure 9.6 IMS T414 Links connected by buffers 



382 
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10.1 DC electrical characteristics 



Table 10.1 Absolute maximum ratings 



SYMBOL 


TARAfVIETER 


IVIIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 





7.0 


V 


1,2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1,2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



All voltages are with respect to GND. 



2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and VCC. 



Table 10.2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 

VI, vo 

CL 
TA 


DC supply voltage 
Input or output voltage 
Load capacitance on any pin 
Operating temperature range 


4.75 





5.25 

VCC 

60 

70 


V 
V 
PF 
°C 


1 
1,2 

3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 10.3 DC characteristics 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VGG+0.5 


V 


1,2 


VIL 


Low level input voltage 


-0.5 


0.8 


V 


1,2 


II 


Input current @ GND<VI<VCC 




±10 


mA 


1,2.7 








±50 


/.A 


1,2,8 


VOH 


Output high voltage @ IOH=2mA 


VGG-1 




V 


1,2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1,2 


lOS 


Output short circuit current @ GND<VO<VCG 


36 


65 


mA 


1,2,3.6 






65 


100 


mA 


1,2,4,6 


lOZ 


Tristate output current @ GND<VO<VGC 




±10 


/xA 


1,2 


PD 


Power dissipation 




900 


mW 


2,5 


GIN 


Input capacitance @ f=1 MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=1 MHz 




10 


PF 


6 



Notes 



1 Ail voltages are with respect to GND. 

2 Parameters for IMS T414-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading and program execution. 

6 This parameter is sampled and not 100% tested. 

7 For inputs other than those in Note 8. 

8 For MemReq, MemWait, MemConfig, Analyse, Reset, Clockln, EventReq, LinklnO-3, 
LinkSpecial, LInkOSpecial, Link123Special, BootFromRom, HoldToGND. 



10.2 Equivalent circuits 



VCC 



R1 



Output 



50pF 
GND - 



R2 



Load for: 


R1 


R2 


Equivalent load: 


Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky TTL inputs 



Diodes are 1N916 



Figure 10.1 Load circuit for AC measurements 
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Vdd-1 " 
Inputs 

OV 

Vdd-1 
Inputs 

OV 

Vdd ■ 
Outputs 

OV 

Vdd 
Outputs 

OV 




\ 




\ 

VIH 

VIL 
/ 




/ 

tpHL 




"^ 


\l.5V 




tpLH 






^ 


7M.5V 







Figure 10.2 AC measurements timing waveforms 



Test point 
Output under test 



X" 



510R J f^ 



vcc 



GND ■ L- 



Figure 10.3 Tristate load circuit for AC measurements 
10.3 AC timing characteristics 

Table 10.4 Input, output edges 



SYIVIBOL 


PARAIVIETER 


IVIIN 


IVIAX 


UNITS 


NOTE 


TDr 


Input rising edges 


2 


20 


ns 


1,2 


TDf 


Input falling edges 


2 


20 


ns 


1,2 


TQr 


Output rising edges 




25 


ns 


1 


TQf 


Output falling edges 




15 


ns 


1 


TSOLaHZ 


Address high to tristate 


a 


a+6 


ns 


3 


TSOLaLZ 


Address low to tristate 


a 


a+6 


ns 


3 



Notes 



1 Non-link pins; see section on links. 

2 All inputs except Clockln; see section on Clockln. 



3 a is T2 where T2 can be from one to four periods Tm in length. 
Address lines include MennnotWrDO, MemnotRf D1 , MemAD2-31 . 
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Figure 10.4 IMS T414 input and output edge timing 
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Figure 10.5 IMS T414 tristate timing relative to notMemSO 
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Notes 



Figure 10.6 Typical rise/fall times 



1 Skew is measured between notMemSO with a standard load (2 Schottky TTL Inputs and SOpF) and 
notMemSO with a load of 2 Schottky TTL inputs and varying capacitance. 
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10.4 Power rating 

Internal power dissipation Pint of transputer and peripheral chips depends on VCC, as shown in figure 10.7. 
Pint is substantially independent of temperature. 

Total power dissipation P^ of the chip is 

Pd = Pint + Pio 
where Pjo is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj^TA+ejA*PD 

where Ta is the external ambient temperature in °C and BJa is the junction-to-ambient thermal resistance in 
°C/W. BJa for each package is given in the Packaging Specifications section. 



600 i 

Power 500 
PINT 

"^^ 400-| 

300- 




— ^1 — , — , — , — , — ^ — , — , — , — , — , — , — ^ — P 

4.4 4.6 4.8 5.0 5.2 5.4 5.6 
VCC Volts 



Figure 10.7 IMS T414 internal power dissipation vs VCC 
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Performance 



The performance of the transputer is measured in terms of the number of bytes required for the program, and 
the number of (internal) processor cycles required to execute the program. The figures here relate to occann 
programs. For the same function, other languages should achieve approximately the same performance as 
Occam. 



11.1 



Performance overview 



These figures are averages obtained from detailed simulation, and should be used only as an initial guide; 
they assume operands are of type INT. The abbreviations in table 1 1 .1 are used to represent the quantities 
indicated. In the replicator section of the table, figures in braces {} are not necessary if the number of 
replications is a compile time constant. To estimate performance, add together the time for the variable 
references and the time for the operation. 



Table 1 1 .1 Key to performance table 



np 

ne 

r 

ts 

w 

P 

Eg 

Et 

Tb 

Tbp 

Tbc 

nsp 

nap 



number of component processes 

number of processes earlier in queue 

1 if INT parameter or array parameter, if not 

number of table entries (table size) 

width of constant in nibbles 

number of places to shift 

expression used in a guard 

timer expression used in a guard 

most significant bit set of multiplier ((-1) if the multiplier is 0) 

most significant bit set in a positive multiplier when counting from zero ((-1) if the multiplier is 0) 

most significant bit set in the two's complement of a negative multiplier 

Number of scalar parameters in a procedure 

Number of array parameters in a procedure 
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Table 1 1 .2 Performance 





Size (bytes) 


Time (cycles) 


Names 

variables 

in expression 
assigned to or Input to 
in PROC or FUNCTION call, 
corresponding to an INT parameter 

channels 


1.1+r 
1.1+r 

1.1+r 
1.1 


2.1+2(r) 
1.1 +(r) 

1.1 +(r) 
2.1 


Array Variables (for single dimension arrays) 
constant subscript 
variable subscript 
expression subscript 




5.3 
5.3 




7.3 
7.3 


Declarations 

CHAN OF protocol 

[size] CHAN OF protocol 
PROC 


3.1 

9.4 

body+2 


3.1 

2.2 + 20.2*size 




Primitives 

assignment 

input 

output 

STOP 

SKIP 



4 
1 
2 





26.5 

26 

25 




Aritlimetic operators 
+ 

* 

/ 

REM 

» « 


1 
2 
2 
2 
2 


1 

39 
40 
38 
3+p 


l\/lodu[o Aritlimetic operators 
PLUS 
MINUS 
TIMES (fast multiply) 


2 

1 
1 


2 

1 

4+Tb 


Boolean operators 

OR 

AND NOT 


4 

1 


8 
2 


Comparison operators 
= constant 
= variable 
<> constant 
<> variable 
> < 
>= <= 



2 
1 
3 
1 
2 


1 
3 
3 
5 
2 
4 


Bit operators 

/\ \/ X - 


2 


2 


Expressions 

constant in expression 
check if error 


w 

4 


w 

6 



1 1 Performance 



389 



Table 1 1 .3 Performance 





Size (bytes) 


Time (cycles) 


Timers 






timer input 


2 


3 


timer AFTER 






if past time 


2 


4 


witii empty timer queue 


2 


31 


non-empty timer queue 


2 


38+ne*9 


ALT (timer) 






witii empty timer queue 


6 


52 


non-empty timer queue 


6 


59+ne*9 


timer alt guard 


8+2Eg+2Et 


34+2Eg+2Et 


Constructs 






SEQ 








IF 


1.3 


1.4 


if guard 


3 


4.3 


ALT (non timer) 


6 


26 


alt channel guard 


10.2+2Eg 


20+2Eg 


skip alt guard 


8+2Eg 


10+2Eg 


PAR 


11.5+(np-1)*7.5 


19.5+(np-1)*30.5 


WHILE 


4 


12 


Procedure or function call 








3.5+(nsp-2)*1.1 


16.5+(nsp-2)*1.1 




+nap*2.3 


+nap*2.3 


Replicators 






replicated SEQ 


7.3{+5.1} 


(-3.8)+15.1*count{+7.1} 


replicated IF 


12.3{+5.1} 


(-2.6)+19.4*count{+7.1} 


replicated ALT 


24.8{+10.2} 


25.4+33.4*count{+14.2} 


replicated timer ALT 


24.8{+10.2} 


62.4+33.4*count{+14.2} 


replicated PAR 


39.1 {+5.1} 


(-6.4)+70.9*count{+7.1} 



11.2 Fast multiply, TIMES 

The IMS T414 has a fast integer multiplication instruction product. The time taken for a fast multiply is 4+Tb. 
The time taken for a multiplication by zero is 3 cycles. For example, if the multiplier is 1 the time taken is 
4 cycles, if the multiplier is -1 (all bits set) the time taken is 35 cycles. 

Implementations of high level languages on the transputer may take advantage of this instruction. For example, 
the Occam modulo arithmetic operator TIMES is implemented by the instruction and the right-hand operand is 
treated as the multiplier. The fast multiplication instruction is also used in high level language implementations 
for the multiplication implicit in multi-dimensional array access. 



11.3 



Arithmetic 



A set of functions are provided within the development system to support the efficient implementation of 
multiple length integer arithmetic and floating point arithmetic. In table 11.4 n gives the number of places 
shifted and all arguments and results are assumed to be local. Full details of these functions are provided 
in the OCCam reference manual, supplied as part of the development system and available as a separate 
publication. 

When calculating the execution time of the predefined maths functions, no time needs to be added for calling 
overhead. These functions are compiled directly into special purpose instructions which are designed to 
support the efficient implementation of multiple length integer arithmetic and floating point arithmetic. 
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Table 1 1 .4 Arithmetic performance 









+ cycles for 


Function 




Cycles 


parameter access f 


LONGADD 




2 


7 


LONGSUM 




3 


8 


LONGSUB 




2 


7 


LONGDIFF 




3 


8 


LONGPROD 




34 


8 


LONGDIV 




36 


8 


SHIFTRIGHT 


(n<32) 


4+n 


8 




(n>=32) 


n-27 




SHIFTLEFT 


(n<32) 


4+n 


8 




(n>=:32) 


n-27 




NORMALISE 


(n<32) 


n+6 


7 




(n>=32) 


n-25 






(n=64) 


4 




ASHIFTRIGHT 




SHIFTRIGHT+2 


5 


ASHIFTLEFT 




SHIFTLEFT+4 


5 


ROTATERIGHT 




SHIFTRIGHT 


7 


ROTATELEFT 




SHIFTLEFT 


7 


FRACMOTL 




LONGPROD+4 


5 



t Assuming local variables. 



11.4 Floating point operations 



Floating point operations for the IMS T414 are provided by a run-time package. This requires approximately 
400 bytes of memory for the single length arithmetic operations, and 2500 bytes for the double length arithmetic 
operations. Table 1 1 .5 summarizes the estimated performance of the package. 



Table 11.5 IMS T414 floating point operations performance 





Processor cycles 


IMS T414 


Typical 


Worst 


REAL32 + - 
* 

/ 

<> = >=<=<> 

REAL64 + - 
* 

/ 


230 
200 
245 
60 

565 

760 

1115 

60 


300 
240 
280 
60 

700 

940 

1420 

60 



1 1 Performance 
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11.5 Effect of external memory 

Extra processor cycles may be needed when program and/or data are held in external memory, depending 
both on the operation being performed, and on the speed of the external memory. After a processor cycle 
which initiates a write to memory, the processor continues execution at full speed until at least the next 
memory access. 

Whilst a reasonable estimate may be made of the effect of external memory, the actual performance will 
depend upon the exact nature of the given sequence of operations. 

External memory is characterized by the number of extra processor cycles per external memory cycle, denoted 
as e. For the IMS T414, with the fastest external memory the value of e is 2; a typical value for a large external 
memory is 5. 

If program is stored in external memory, and e has the value 2 or 3, then no extra cycles need be estimated for 
linear code sequences. For larger values of e, the number of extra cycles required for linear code sequences 
may be estimated at (e-3)/4. A transfer of control may be estimated as requiring e+3 cycles. 

These estimates may be refined for various constructs. In table 11 .6 n denotes the number of components in 
a construct. In the case of IF, the n'th conditional is the first to evaluate to TRUE, and the costs include the 
costs of the conditionals tested. The number of bytes in an array assignment or communication is denoted 
byb. 

Table 1 1 .6 External memory performance 





IMS T414 1 


Program off chip 


Data off chip 


Boolean expressions 


e-2 





IF 


3en-8 


en 


Replicated IF 


(6e-4)n+7 


(5e-2)n+8 


Replicated SEQ 


(3e-3)n+2 


(4e-2)n 


PAR 


(3e-1)n+8 


3en+4 


Replicated par 


(10e-8)n+8 


16en-12 


ALT 


(2e-4)n+6e 


(2e-2)n+10e-8 


Array assignment and 





max (2e, e(b/2)) 


communication in 






one transputer 







For the IMS T414 the effective rate of INMOS links is slowed down on output from external memory by e 
cycles per word output, and on input to external memory at 10 Mbits/sec by e-6 cycles per word if e>6. 

The following simulation results illustrate the effect of storing program and/or data in external memory. The 
results are normalized to 1 for both program and data on chip. The first program (Sieve of Erastosthenes) 
is an extreme case as it is dominated by small, data access intensive loops; it contains no concurrency, 
communication, or even multiplication or division. The second program is the pipeline algorithm for Newton 
Raphson square root computation. 
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Table 11.7 IMS T414 external memory performance 





Program 


e=2 


e=3 


e=4 


e=5 


On chip 


Program off chip 


1 
2 


1.3 
1.1 


1.5 
1.2 


1.7 
1.2 


1.9 
1.3 




Data off chip 


1 
2 


1.5 
1.2 


1.8 
1.4 


2.1 
1.6 


2.3 

1.7 




Program and data off chip 


1 
2 


1.8 
1.3 


2.2 
1.6 


2.7 
1.8 


3.2 
2.0 





11.6 Interrupt latency 

If the process is a high priority one and no other high priority process is running, the latency is as described 
in table 1 1 .8. The timings given are in full processor cycles TPCLPCL; the number of Tm states is also given 
where relevant. Maximum latency assumes all memory accesses are internal ones. 



Table 1 1 .8 Interrupt latency 





Typical 


Maximum 


TPCLPCL 


Tm 


TPCLPCL 


Tm 


IMST414 


19 


38 


53 


116 
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12.1 84 pin grid array package 
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Figure 12.1 IMS T414 84 pin grid array package pinout 
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Figure 12.2 84 pin grid array package dimensions 



Table 12.1 84 pin grid array pacl<age dimensions 



DilVI 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B 


17.019 


±0.127 


0.670 


±0.005 




C 


2.456 


±0.278 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.025 


0.018 


±0.002 


Pin diameter 


G 


1.143 


±0.127 


0.045 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Ciiamfer 



Package weight is approximately 7.2 grams 

Table 12.2 84 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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12.2 84 pin PLCC J-bend package 
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Figure 12.3 IMS T414 84 pin PLCC J-bend package pinout 



Notes 



1 Since the manufacture of the IMS T414, the pin numbers for the 84 pin J-bend chip carrier have been re- 
arranged, however the pin functions remain the same. 
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Detail A-A 



Figure 12.4 84 pin PLCC J-bend package dimensions 



Table 12.3 84 pin PLCC J-bend package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


30.226 


±0.127 


1.190 


±0.005 




B 


29.312 


±0.127 


1.154 


±0.005 




C 


3.810 


±0.127 


0.150 


±0.005 




D 


0.508 


±0.127 


0.020 


±0.005 




F 


1.270 


±0.127 


0.050 


±0.005 




G 


0.457 


±0.127 


0.018 


±0.005 




J 


0.000 


±0.051 


0.000 


±0.002 




K 


0.457 


±0.127 


0.018 


±0.005 




L 


0.762 


±0.127 


0.030 


±0.005 





Package weight is approximately 7.0 grams 

Table 12.4 84 pin PLCC J-bend package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 
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°c/w 
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13 Ordering 

This section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 



Table 13.1 IMS T414 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IIVIS T414-G15S 
IIVIS T414-G20S 

IIVIS T414-J15S 
IMS T414-J20S 


15 MHz 
20 MHz 

15 MHz 
20 MHz 


67 ns 
50 ns 

67 ns 
50 ns 


3.0 
4.0 

3.0 
4.0 


Ceramic Pin Grid 
Ceramic Pin Grid 

Plastic PLCC J-Bend 
Plastic PLCC J-Bend 
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FEATURES 

1 6 bit architecture 

50 ns internal cycle time 

20 MIPS (peak) instruction rate 

IMS T222-20 is pin compatible with IMS T225-20 

4 Kbytes on-chip static RAM 

40 Mbytes/sec sustained data rate to internal memory 

64 Kbytes directly addressable external memory 

20 Mbytes/sec sustained data rate to external memory 

950 ns response to interrupts 

Four INMOS serial links 5/10/20 Mbits/sec 

Bi-directional data rate of 2.4 Mbytes/sec per link 

Internal timers of 1 /zs and 64 fis 

Boot from ROM or communication links 

Single 5 MHz clock input 

Single +5V ±5% power supply 

MIL-STD-883C processing is available 

APPLICATIONS 

Real time processing 

Microprocessor applications 

High speed multi processor systems 

Industrial control 

Robotics 

System simulation 

Digital signal processing 

Telecommunications 

Fault tolerant systems 

Medical instrumentation 
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1 Introduction 

The IMS T222 transputer is a 16 bit CMOS microcomputer with 4 Kbytes on-chip RAM for high speed 
processing, an external memory interface and four standard INMOS communication links. The instruction set 
achieves efficient implementation of high level languages and provides direct support for the Occam model 
of concurrency when using either a single transputer or a network. Procedure calls, process switching and 
typical interrupt latency are sub-microsecond. A device running at 20 MHz achieves an instruction throughput 
of 20 MIPS peak. The extended temperature version of the device complies with MIL-STD-883C. 

For convenience of description, the IMS T222 operation is split into the basic blocks shown in figure 1.1. 



vcc 

GND- 

CapPlus - 

CapMinus - 

Reset - 

Analyse - 



Error -*- 

BootFromROM — 

Clockln — 



DisablelntRam 



ProcClockOut ^ 

notMemCE ■ 

notMemWrBO-1 - 



MemWait - 

Mem B Ace - 

MemReq - 

MemGranted - 



System 
services 



Timers 



4k bytes 

of 

On-chip 

RAM 



External 
Memory 
Interface 



16 



_N 



16 bit 
Processor 



Link 
Services 



16 



JS 



Link 



V Interface 



16 



Link 
v\ Interface 



16 



Link 
U Interface 



16 



Link 



V Interface 



Event 



LinkSpeciai 

LinkOSpeciai 

Link123Special 

LinkInO 
LinkOutO 

Linkim 
LinkOutI 

Linkln2 
LmkOut2 

Linkln3 
LinkOutS 

EventReq 
- EventAck 



16 



16 






MemDO-15 
MemAO-15 



Figure 1 .1 IMS T222 block diagram 

The IMS T222 can directly access a linear address space of 64 Kbytes. The 16 bit wide non-multiplexed 
external memory interface provides a data rate of up to 2 bytes every 100 nanoseconds (20 Mbytes/sec) for 
a 20 MHz device. 



System Services include processor reset and bootstrap control, together with facilities for error analysis. 
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The INMOS communication links allow networks of transputers to be constructed by direct point to point 
connections with no external logic. The links support the standard operating speed of 10 Mbits/sec, but also 
operate at 5 or 20 Mbits/sec. The links support overlapped acknowledge; each IMS T222 link can transfer 
data bi-directionally at up to 2.05 Mbytes/sec. 

The IMS T222 is designed to implement the OCCam language, detailed in the OCCam Reference Manual, 
but also efficiently supports other languages such as C and Pascal. Access to the transputer at machine 
level is seldom required, but if necessary refer to the Transputer Instruction Set - A Compiler Writers' Guide. 

This data sheet supplies hardware implementation and characterisation details for the IMS T222. It is intended 
to be read in conjunction with the Transputer Architecture chapter, which details the architecture of the 
transputer and gives an overview of occann. 
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Pin designations 



Table 2.1 IMS T222 system services 



Pin 


In/Out 


Function 


VCC, GND 

CapPlus, CapMinus 

Clockln 

Reset 

Error 

Analyse 

BootFromRom 

DisablelntRAM 

HoldToGND 


in 
in 

out 
in 
in 
in 


Power supply and return 

External capacitor for internal clock power supply 

Input clock 

System reset 

Error indicator 

Error analysis 

Bootstraps from external ROM or from link 

Disable internal RAM 

Must be connected to GND 



Table 2.2 IMS T222 external memory interface 



Pin 


in/Out 


Function 


ProcClockOut 


out 


Processor clock 


MemAO-15 


out 


Sixteen address lines 


MemDO-15 


in/out 


Sixteen data lines 


notMemWrBO-1 


out 


Two byte-addressing write strobes 


notMemCE 


out 


Chip enable 


Mem B Ace 


in 


Byte access mode selector 


MemWait 


in 


Memory cycle extender 


MemReq 


in 


Direct memory access request 


MemG ranted 


out 


Direct memory access granted 



Table 2.3 IMS T222 event 



Pin 


In/Out 


Function 


EventReq 
EventAck 


in 

out 


Event request 

Event request acknowledge 



Table 2.4 IMS T222 link 



Pin 


In/Out 


Function 


LinklnO-3 


in 


Four serial data input channels 


LinkOutO-3 


out 


Four serial data output channels 


LinkSpeclal 


in 


Select non-standard speed as 5 or 20 Mbits/sec 


LinkOSpecial 


in 


Select special speed for Link 


Link123Special 


in 


Select special speed for Links 1 ,2,3 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 448. 



403 



3 Processor 

The 16 bit processor contains instruction processing logic, instruction and work pointers, and an operand 
register. It directly accesses the high speed 4 Kbyte on-chip memory, which can store data or program. 
Where larger amounts of memory or programs in ROM are required, the processor has access to 64 Kbytes 
of memory via the External Memory Interface (EMI). 



3.1 



Registers 



The design of the transputer processor exploits the availability of fast on-chip memory by having only a small 
number of registers; six registers are used in the execution of a sequential process. The small number of 
registers, together with the simplicity of the instruction set, enables the processor to have relatively simple 
(and fast) data-paths and control logic. The six registers are: 

The workspace pointer which points to an area of store where local variables are kept. 

The instruction pointer which points to the next instruction to be executed. 

The operand register which is used in the formation of instruction operands. 

The A, B and C registers which form an evaluation stack. 

/A, B and C are sources and destinations for most arithmetic and logical operations. Loading a value into the 
stack pushes B into C, and A into B, before loading A. Storing a value from A, pops B into A and C into B. 

Expressions are evaluated on the evaluation stack, and instructions refer to the stack implicitly. For example, 
the add instruction adds the top two values in the stack and places the result on the top of the stack. The use of 
a stack removes the need for instructions to respecify the location of their operands. Statistics gathered from a 
large number of programs show that three registers provide an effective balance between code compactness 
and implementation complexity. 

No hardware mechanism is provided to detect that more than three values have been loaded onto the stack. 
It is easy for the compiler to ensure that this never happens. 

Any location in memory can be accessed relative to the workpointer register, enabling the workspace to be 
of any size. 

Further register details are given in Transputer Instruction Set - A Compiler Writers' Guide. 



Registers 




Locals 




Program 


A 








B 






C 






Workspace 








Next Inst 












Operand 





Figure 3.1 Registers 
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3.2 



Instructions 



The instruction set has been designed for simple and efficient compilation of high-level languages. All in- 
structions have the same format, designed to give a compact representation of the operations occurring most 
frequently in programs. 

Each instruction consists of a single byte divided into two 4-bit parts. The four most significant bits of the byte 
are a function code and the four least significant bits are a data value. 





Function Data 


7 4 3 


Operand Register 





Figure 3.2 Instruction format 



3.2.1 



DltecX functions 



The representation provides for sixteen functions, each with a data value ranging from to 15. Ten of these, 
shown in table 3.1, are used to encode the most important functions. 

Table 3.1 Direct functions 



load constant 


add constant 




load local 


store local 


load local pointer 


load non-local 


store non-local 




jump 


conditional jump 


call 



The most common operations in a program are the loading of small literal values and the loading and storing 
of one of a small number of variables. The load constant instruction enables values between and 15 to be 
loaded with a single byte instruction. The load local and store local instructions access locations in memory 
relative to the workspace pointer. The first 16 locations can be accessed using a single byte instruction. 

The load non-local and store non-local instructions behave similarly, except that they access locations in 
memory relative to the A register. Compact sequences of these instructions allow efficient access to data 
structures, and provide for simple implementations of the static links or displays used in the implementation 
of high level programming languages such as Occam, or Pascal. 



3.2.2 



Prefix functions 



Two more function codes allow the operand of any instruction to be extended in length; prefix and negative 
prefix. 

All instructions are executed by loading the four data bits into the least significant four bits of the operand 
register, which is then used as the instruction's operand. All instructions except the prefix instructions end by 
clearing the operand register, ready for the next Instruction. 

The pref/x instruction loads its four data bits into the operand register and then shifts the operand register up 
four places. The negative prefix instruction is similar, except that it complements the operand register before 
shifting it up. Consequently operands can be extended to any length up to the length of the operand register 
by a sequence of prefix instructions. In particular, operands in the range -256 to 255 can be represented 
using one prefix instruction. 



3 Processor 
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The use of prefix instructions has certain beneficial consequences. Firstly, they are decoded and executed 
in the same way as every other instruction, which simplifies and speeds instruction decoding. Secondly, they 
simplify language compilation by providing a completely uniform way of allowing any instruction to take an 
operand of any size. Thirdly, they allow operands to be represented in a form independent of the processor 
wordlength. 

3.2.3 Indirect functions 

The remaining function code, operate, causes its operand to be interpreted as an operation on the values 
held in the evaluation stack. This allows up to 16 such operations to be encoded in a single byte instruction. 
However, the prefix instructions can be used to extend the operand of an operate instruction just like any 
other. The instruction representation therefore provides for an indefinite number of operations. 

Encoding of the Indirect functions is chosen so that the most frequently occurring operations are represented 
without the use of a prefix instruction. These include arithmetic, logical and comparison operations such as 
add, exclusive or and greater than. Less frequently occurring operations have encodings which require a 
single prefix operation. 

3.2.4 Expression evaluation 

Evaluation of expressions sometimes requires use of temporary variables in the workspace, but the number 
of these can be minimised by careful choice of the evaluation order. 

Table 3.2 Expression evaluation 



Program 


Mnemonic 


X :=0 


Idc 

StI X 


X := #24 


pfix 2 
Idc 4 

StI X 


X := y + z 


Idl y 
Idl z 
add 

StI X 



3.2.5 Efficiency of encoding 

Measurements show that about 70% of executed instructions are encoded in a single byte; that is, without 
the use of prefix instructions. Many of these instructions, such as load constant and add require just one 
processor cycle. 

The instruction representation gives a more compact representation of high level language programs than 
more conventional instruction sets. Since a program requires less store to represent it, less of the memory 
bandwidth is taken up with fetching instructions. Furthermore, as memory is word accessed the processor 
will receive two instructions for every fetch. 

Short instructions also improve the effectiveness of instruction pre-fetch, which in turn improves processor 
performance. There is an extra word of pre-fetch buffer, so the processor rarely has to wait for an instruction 
fetch before proceeding. Since the buffer is short, there is little time penalty when a jump instruction causes 
the buffer contents to be discarded. 
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3.3 



Processes and concurrency 



A process starts, performs a number of actions, and then either stops without completing or terminates 
complete. Typically, a process is a sequence of instructions. A transputer can run several processes in 
parallel (concurrently). Processes may be assigned either high or low priority, and there may be any number 
of each (page 407). 

The processor has a microcoded scheduler which enables any number of concurrent processes to be exe- 
cuted together, sharing the processor time. This removes the need for a software kernel. 



At any time, a concurrent process may be 

Active 

Inactive - 



Being executed. 

On a list waiting to be executed. 

Ready to input. 

- Ready to output. 

- Waiting until a specified time. 

The scheduler operates in such a way that inactive processes do not consume any processor time. It allocates 
a portion of the processor's time to each process in turn. Active processes waiting to be executed are held 
in two linked lists of process workspaces, one of high priority processes and one of low priority processes 
(page 407). Each list is implemented using two registers, one of which points to the first process in the list, 
the other to the last. In the Linked Process List figure 3.3, process S is executing and P, Q and R are active, 
awaiting execution. Only the low priority process queue registers are shown; the high priority process ones 
perform in a similar manner. 





Registers 






Locals 






Program 






FPtn (Front) 




P 


^ 












BPtn (Back) 














Q 


j^ 
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-^ 
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S 










Workspace 






Next Inst 












Operand 

























Figure 3.3 Linked process list 



Table 3.3 Priority queue control registers 



Function 


High Priority 


Low Priority 


Pointer to front of active process list 
Pointer to back of active process list 


FptrO 
BptrO 


Fptn 
Bptn 



Each process runs until it has completed its action, but is descheduled whilst waiting for communication from 
another process or transputer, or for a time delay to complete. In order for several processes to operate in 
parallel, a low priority process is only permitted to run for a maximum of two time slices before it is forcibly 
descheduled at the next descheduling point (page 410). The time slice period is 5120 cycles of the external 
5 MHz clock, giving ticks approximately 1 ms apart. 



Processor 407 



A process can only be descheduled on certain instructions, known as descheduling points (page 410). As a 
result, an expression evaluation can be guaranteed to execute without the process being timesliced part way 
through. 

Whenever a process Is unable to proceed, its instruction pointer is saved in the process workspace and 
the next process taken from the list. Process scheduling pointers are updated by instructions which cause 
scheduling operations, and should not be altered directly. Actual process switch times are less than 1 /iS, as 
little state needs to be saved and it is not necessary to save the evaluation stack on rescheduling. 

The processor provides a number of special operations to support the process model, including start process 
and end process. When a main process executes a parallel construct, start process instructions are used 
to create the necessary additional concurrent processes. A start process instruction creates a new process 
by adding a new workspace to the end of the scheduling list, enabling the new concurrent process to be 
executed together with the ones already being executed. When a process is made active it is always added 
to the end of the list, and thus cannot pre-empt processes already on the same list. 

The correct termination of a parallel construct is assured by use of the end process instruction. This uses 
a workspace location as a counter of the parallel construct components which have still to terminate. The 
counter is initialised to the number of components before the processes are started. Each component ends 
with an end process instruction which decrements and tests the counter. For all but the last component, the 
counter is non zero and the component is descheduled. For the last component, the counter is zero and the 
main process continues. 

3.4 Priority 

The IMS T222 supports two levels of priority. Priority 1 (low priority) processes are executed whenever there 
are no active priority (high priority) processes. 

High priority processes are expected to execute for a short time. If one or more high priority processes are 
able to proceed, then one is selected and runs until it has to wait for a communication, a timer input, or until 
it completes processing. 

If no process at high priority is able to proceed, but one or more processes at low priority are able to proceed, 
then one is selected. 

Low priority processes are periodically timesliced to provide an even distribution of processor time between 
computationally intensive tasks. 

If there are n low priority processes, then the maximum latency from the time at which a low priority process 
becomes active to the time when it starts processing is 2n-2 timeslice periods. It is then able to execute for 
between one and two timeslice periods, less any time taken by high priority processes. This assumes that 
no process monopolises the transputer's time; i.e. it has a distribution of descheduling points (page 410). 

Each timeslice period lasts for 5120 cycles of the external 5 MHz input clock (approximately 1 ms at the 
standard frequency of 5 MHz). 

If a high priority process is waiting for an external channel to become ready, and if no other high priority 
process is active, then .the interrupt latency (from when the channel becomes ready to when the process 
starts executing) is typically 19 processor cycles, a maximum of 53 cycles (assuming use of on-chip RAM). 

3.5 Communications 

Communication between processes is achieved by means of channels. Process communication is point-to- 
point, synchronised and unbuffered. As a result, a channel needs no process queue, no message queue and 
no message buffer. 

A channel between two processes executing on the same transputer is implemented by a single word in 
memory; a channel between processes executing on different transputers is implemented by point-to-point 
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links. The processor provides a number of operations to support message passing, the most important being 
input message and output message. 

The input message and output message instructions use the address of the channel to determine whether 
the channel is internal or external. Thus the same instruction sequence can be used for both, allowing a 
process to be written and compiled without knowledge of where its channels are connected. 

The process which first becomes ready must wait until the second one is also ready. A process performs an 
input or output by loading the evaluation stack with a pointer to a message, the address of a channel, and 
a count of the number of bytes to be transferred, and then executing an input message or output message 
instruction. Data is transferred if the other process is ready. If the channel is not ready or is an external one 
the process will deschedule. 



3.6 



Timers 



The transputer has two 16 bit timer clocks which 'tick' periodically. The timers provide accurate process 
timing, allowing processes to deschedule themselves until a specific time. 

One timer is accessible only to high priority processes and is incremented every microsecond, cycling com- 
pletely in approximately 65 milliseconds. The other is accessible only to low priority processes and is incre- 
mented every 64 microseconds, giving exactly 15625 ticks in one second. It has a full period of approximately 
four seconds. 



Table 3.4 Timer registers 



C/oc/cO 
Ciockl 
TNextRegO 
TNextRegl 



Current value of high priority (level 0) process clock 

Current value of low priority (level 1) process clock 

Indicates time of earliest event on high priority (level 0) timer queue 

Indicates time of earliest event on low priority (level 1) timer queue 



The current value of the processor clock can be read by executing a load timer instruction. A process can 
arrange to perform a timer input, in which case it will become ready to execute after a specified time has 
been reached. The //mer /npuf instruction requires a time to be specified. If this time is in the 'past' then the 
instruction has no effect. If the time is in the 'future' then the process is descheduled. When the specified 
time is reached the process is scheduled again. 

Figure 3.4 shows two processes waiting on the timer queue, one waiting for time 21 , the other for time 31 . 
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Figure 3.4 Timer registers 
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4 Instruction set summary 

The Function Codes table 4.7. gives the basic function code set (page 404). Where the operand is less 
than 16, a single byte encodes the complete instruction. If the operand is greater than 15, one prefix 
instruction {pfix) is required for each additional four bits of the operand. If the operand is negative the first 
prefix instruction will be nfix. 

Table 4.1 pre//x coding 







Function 


Memory 


Mnemonic 


code 


code 


Idc 


#3 


#4 


#43 


Idc 


#35 






is coded as 








pfix 


#3 


#2 


#23 


Idc 


#5 


#4 


#45 


Idc 


#987 






is coded as 








pfix 


#9 


#2 


#29 


pfix 


#8 


#2 


#28 


Idc 


#7 


#4 


#47 


Idc 


-31 (Idc 


#FFE1) 




is coded as 








nfix 


#1 


#6 


#61 


Idc 


#1 


#4 


#41 



Tables 4.8 to 4.17 give details of the operation codes. Where an operation code is less than 16 (e.g. add: 
operation code 05), the operation can be stored as a single byte comprising the operate function code F and 
the operand (5 in the example). Where an operation code Is greater than 15 (e.g. ladd: operation code 16), 
the prefix function code 2 is used to extend the instruction. 



Table 4.2 operate coding 



Mnemonic 



Function 
code 



Memory 
code 



add 
is coded as 

opr 

ladd 
is coded as 

pfix 
opr 



(op. code #5) 

add 

(op. code #1 6) 

#1 
#6 



#F 



#2 
#F 



#F5 

#F5 

#21 F6 

#21 
#F6 



The Processor Cycles column refers to the number of periods TPCLPCL taken by an instruction executing 
In internal memory. The number of cycles is given for the basic operation only; where the memory code 
for an instruction is two bytes, the time for the prefix function (one cycle) should be added. For a 20 MHz 
transputer one cycle is 50 ns. Some instruction times vary. Where a letter is included in the cycles column it 
is interpreted from table 4.3. 
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Table 4.3 Instruction set interpretation 



ident 



Interpretation 



b 
n 
w 



Bit number of tine highest bit set in register A. Bit is the least significant bit. 

Number of places shifted. 

Number of words in the message. Part words are counted as full words. If the message 
is not word aligned the number of words is increased to include the part words at either 
end of the message. 



The DE column of the tables indicates the descheduling/error features of an instruction as described in 
table 4.4. 

Table 4.4 Instruction features 



Ident 


Feature 


See page: 


D 

E 


The instruction is a descheduling point 
The instruction will affect the Error flag 


410 
410.419 



4.1 



Descheduling points 



The instructions in table 4.5 are the only ones at which a process may be descheduled (page 406). They are 
also the ones at which the processor will halt if the Analyse pin is asserted (page 418). 



Table 4.5 Descheduling point instructions 



Inpui message 
timer alt wait 
jump 



output message 
timer input 
loop end 



output byte 
stop on error 
end process 



output word 
alt wait 
stop process 



4.2 Error instructions 

The instructions in table 4.6 are the only ones which can affect the Error flag (page 419) directly. 



Table 4.6 Error setting instructions 



add 


add constant 


subtract 




multiply 




divide 


remainder 


long add 


long subtract 


long divide 




set error 


testerr 






check word 


check subscript from 


check single 


check count from 1 



4 Instruction set summary 
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Table 4.7 IMS T222 function codes 



Function 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 





OX 


J 


3 


jump 


D 


1 


1X 


Idip 


1 


load local pointer 




2 


2X 


pfix 


1 


prefix 




3 


3X 


Idnl 


2 


load non-local 




4 


4X 


Idc 


1 


load constant 




5 


5X 


Idnip 


1 


load non-local pointer 




6 


6X 


nfix 


1 


negative prefix 




7 


7X 


Idl 


2 


load local 




8 


8X 


adc 


1 


add constant 


E 


9 


9X 


call 


7 


call 




A 


AX 


cj 


2 

4 


conditional jump (not taken) 
conditional jump (taken) 




B 


BX 


ajw 


1 


adjust workspace 




C 


OX 


eqc 


2 


equals constant 




D 


DX 


sti 


1 


store local 




E 


EX 


stnl 


2 


store non-local 




F 


FX 


opr 


- 


operate 





Table 4.8 IMS T222 arithmetic/logical operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


46 


24F6 


and 


1 


and 




4B 


24FB 


or 


1 


or 




33 


23F3 


xor 


1 


exclusive or 




32 


23F2 


not 


1 


bitwise not 




41 


24F1 


shI 


n+2 


shift left 




40 


24F0 


shr 


n+2 


shift right 




05 


F5 


add 


1 


add 


E 


00 


FG 


sub 


1 


subtract 


E 


53 


25F3 


mul 


23 


multiply 


E 


2C 


22FC 


div 


24 


divide 


E 


IF 


21 FF 


rem 


21 


remainder 


E 


09 


F9 


gt 


2 


greater than 




04 


F4 


diff 


1 


difference 




52 


25F2 


sum 


1 


sum 




08 


F8 


prod 


b+4 


product 
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Table 4.9 IMS T222 long arithmetic operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


16 


21 F6 


ladd 


2 


long add 


E 


38 


23F8 


Isub 


2 


long subtract 


E 


37 


23F7 


Isum 


3 


long sum 




4F 


24FF 


Idiff 


3 


long diff 




31 


23F1 


Imul 


17 


long multiply 




1A 


21 FA 


Idiv 


19 


long divide 


E 


36 


23F6 


IshI 


n+3 
n-12 


long shift left (n<1 6) 
long shift left(n> 16) 




35 


23F5 


Ishr 


n+3 
n-12 


long shift right (n<16) 
long shift right (n>16) 




19 


21 F9 


norm 


n+5 

n-10 
3 


normalise (n<16) 
normalise (n>16) 
normalise (n=32) 





Table 4.10 IMS T222 general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


00 

3A 
56 
ID 
40 
42 


FO 

23FA 
25F6 
21 FD 
24FC 
24F2 


rev 

xword 
cword 
xdble 
csngI 
mint 


1 

4 
5 
2 
3 

1 


reverse 

extend to word 
check word 
extend to double 
check single 
minimum integer 


E 
E 



Table 4.11 IMS T222 indexing/array operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


02 


F2 


bsub 


1 


byte subscript 




OA 


FA 


wsub 


2 


word subscript 




34 


23F4 


bent 


2 


byte count 




3F 


23FF 


went 


4 


word count 




01 


F1 


lb 


5 


load byte 




3B 


23FB 


sb 


4 


store byte 




4A 


24FA 


move 


2w+8 


move message 





4 instruction set summary 
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Table 4.12 IMS T222 timer handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycies 


Name 


D 

E 


22 


22F2 


Idtimer 


2 


load timer 




2B 
4E 


22FB 
24FE 


tin 
talt 


30 
4 
4 


timer input (time future) 
timer input (time past) 
timer alt start 


D 
D 


51 
47 


25F1 
24F7 


taltwt 
enbt 


15 
48 
8 


timer alt wait (time past) 
timer alt wait (time future) 
enable timer 


D 
D 


2E 


22FE 


dist 


23 


disable timer 





Table 4.13 IMS T222 input/output operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


07 
OB 
OF 
OE 


F7 
FB 
FF 
FE 


in 

out 

outword 
outbyte 


2W+19 
2W+19 

23 

23 


input message 
output message 
output word 
output byte 


D 
D 
D 
D 


43 
44 

45 


24F3 
24F4 

24F5 


alt 
aitwt 

altend 


2 
5 
17 
4 


alt start 

alt wait (channel ready) 
alt wait (channel not ready) 
alt end 


D 
D 


49 
30 


24F9 
23F0 


enbs 
diss 


3 

4 


enable skip 
disable skip 




12 
48 

2F 


21 F2 
24F8 

22FF 


resetch 
enbc 

disc 


3 
7 
5 
8 


reset channel 
enable channel (ready) 
enable channel (not ready) 
disable channel 





Table 4.14 IMS T222 control operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


20 
IB 
30 
06 

21 


22F0 
21 FB 
23FC 

F6 
22F1 


ret 

Idpi 

gajw 

gcall 

lend 


5 
2 
2 

4 
10 
5 


return 

load pointer to instruction 

general adjust workspace 

general call 

loop end (loop) 

loop end (exit) 


D 
D 
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Table 4.15 IMS T222 scheduling operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


OD 


FD 


startp 


12 


start process 


D 


03 


F3 


endp 


13 


end process 


D 


39 


23F9 


runp 


10 


run process 




15 


21 F5 


stopp 


11 


stop process 




IE 


21 FE 


IdprI 


1 


load current priority 





Table 4.16 IMS T222 error handling operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


13 
4D 


21 F3 
24FD 


csubO 
ccnti 


2 
3 


check subscript from 
check count from 1 


E 
E 


29 
10 


22F9 
21 FO 


testerr 
seterr 


2 
3 

1 


test error false and clear (no error) 
test error false and clear (error) 
set error 


E 


55 
57 


25F5 
25F7 


stoperr 
cirhalterr 


2 

1 


stop on error (no error) 
clear halt-on-error 


D 


58 


25F8 


sethalterr 


1 


set halt-on-error 




59 


25F9 


testhalterr 


2 


test halt-on-error 





Table 4.17 IMS T222 processor initialisation operation codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 


2A 


22FA 


testpranal 


2 


test processor analysing 




3E 


23FE 


saveh 


4 


save high priority queue registers 




3D 


23FD 


savel 


4 


save low priority queue registers 




18 


21 F8 


sthf 




store high priority front pointer 




50 


25F0 


sthb 




store high priority back pointer 




10 


21 FC 


stif 




store low priority front pointer 




17 


21 F7 


stib 




store low priority back pointer 




54 


25F4 


sttimer 




store timer 
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5 System services 

System services include all the necessary logic to initialise and sustain operation of the device. They also 
include error handling and analysis facilities. 

5.1 Power 

Power is supplied to the device via the VCC and GND pins. Several of each are provided to minimise 
inductance within the package. All supply pins must be connected. The supply must be decoupled close to 
the chip by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer 
boards are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, othenA/ise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 

5.2 CapPlus, CapMinus 

The internally derived power supply for internal clocks requires an external low leakage, low inductance 1/xF 
capacitor to be connected between CapPlus and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz and 20 MHz. If a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total RGB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 









P.C.B. track 




VCC T 

T 


CapPlus 




1 






Decoupling 
=4= capacitor 
VF 




Phase-locked 
loops 






1 






GND -i 


CapMinus 


P.C.B. track 









Figure 5.1 Recommended PLL decoupling 



5.3 Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer devices may be connected to a common clock, or may have individual clocks providing 
each one meets the specified stability criteria. In a multi-clock system the relative phasing of Clockln clocks 
is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant provided the 
specified limits of Clockln pulse widths are met. 

Oscillator stability is important. Clockln must be derived from a crystal oscillator; RC oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 5.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clock! n pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1.2.4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1.3 


TDC1DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise time 






10 


ns 


1,5 


TDCf 


Clockln fall time 






8 


ns 


1,5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 10.3). 



TDCerror 



1.5v 



2.0v- ~ -^ 
0.8vl II-" 



^ TDCerror 



TDCerror ^ 



TDCLDCH 




TDCHDCL 




TDCLDCL 

90% 

10% 



^ TDCerror 




fTDCr 



Figure 5.2 Clockln timing 



5.4 



Reset 



Reset can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. After VCC 
is valid Clockln should be running for a minimum period TDCVRL before the end of Reset. The falling edge 
of Reset initialises the transputer and starts the bootstrap routine. Link outputs are forced low during reset; 
link inputs and EventReq should be held low. Memory request (DMA) must not occur whilst Reset is high but 
can occur before bootstrap (page 431). If BootFromRom is high bootstrapping will take place immediately 
after Reset goes low, using data from external memory; otherwise the transputer will await an input from any 
link. The processor will be in the low priority state. 



5.5 



Bootstrap 



The transputer can be bootstrapped either from a link or from external ROM. To facilitate debugging, Boot- 
FromRom may be dynamically changed but must obey the specified timing restrictions. It is sampled once 
only by the transputer, before the first instruction is executed after Reset is taken low. 

If BootFromRom is connected high (e.g. to VCC) the transputer starts to execute code from the top two 
bytes in external memory, at address #7FFE. This location should contain a backward jump to a program 
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In ROM. Following this access, BootFromRom may be taken low If required. The processor Is In the low 
priority state, and the IV register points to MemStart (page 420). 



Table 5.2 Reset and Analyse 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 


Power valid before Reset 


10 






ms 




TRHRL 


Reset pulse width high 


8 






Clockln 


1 


TDCVRL 


Clockln running before Reset end 


10 






ms 


2 


TAHRH 


Analyse setup before Reset 


3 






ms 




TRIAL 


Analyse hold after Reset end 


1 






Clockln 


1 


TBRVRL 


BootFromRom setup 









ms 




TRLBRX 


BootFromRom hold after Reset 









ms 


3 


TALBRX 


BootFromRom hold after Analyse 










3 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 

3 Must be stable until after end of bootstrap period. See Bootstrap section. 
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Figure 5.3 Transputer reset timing with Analyse low 
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Figure 5.4 Transputer reset and analyse timing 
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If BootFromRom js connected low (e.g. to GND) the transputer will wait for the first bootstrap message to 
arrive on any one of its links. The transputer is ready to receive the first byte on a link within two processor 
cycles TPCLPCL after Reset goes low. 

If the first byte received (the control byte) is greater than 1 it is taken as the quantity of bytes to be input. The 
following bytes, to that quantity, are then placed in internal memory starting at location MemStart. Following 
reception of the last byte the transputer will start executing code at MemStart as a low priority process. 
BootFromRom may be taken high after reception of the last byte, if required. The memory space immediately 
above the loaded code is used as work space. Messages arriving on other links after the control byte has 
been received and on the bootstrapping link after the last bootstrap byte will be retained until a process inputs 
from them. 

5.6 Peek and poke 

Any location in internal or external memory can be interrogated and altered when the transputer is waiting 
for a bootstrap from link. If the control byte is then four more bytes are expected on the same link. The 
first two byte word is taken as an internal or external memory address at which to poke (write) the second 
two byte word. If the control byte is 1 the next two bytes are used as the address from which to peek (read) 
a word of data; the word is sent down the output channel of the same link. 

Following such a peek or poke, the transputer returns to its previously held state. Any number of accesses 
may be made in this way until the control byte is greater than 1 , when the transputer will commence reading 
its bootstrap program. Any link can be used, but addresses and data must be transmitted via the same link 
as the control byte. 

5.7 Analyse 

If Analyse is taken high when the transputer is running, the transputer will halt at the next descheduling point 
(page 410). From Analyse being asserted, the processor will halt within three time slice periods plus the 
time taken for any high priority process to complete. As much of the transputer status is maintained as is 
necessary to permit analysis of the halted machine. Processor flags Error and HaltOnError are not altered 
at reset, whether Analyse is asserted or not. 

Input links will continue with outstanding transfers. Output links will not make another access to memory 
for data but will transmit only those bytes already in the link buffer. Providing there is no delay in link 
acknowledgement, the links should be inactive within a few microseconds of the transputer halting. 

Reset should not be asserted before the transputer has halted and link transfers have ceased. If BootFrom- 
Rom is high the transputer will bootstrap as soon as Analyse is taken low, otherwise it will await a control 
byte on any link. If Analyse is taken low without Reset going high the transputer state and operation are 
undefined. After the end of a valid Analyse sequence the registers have the values given in table 5.3. 

Table 5.3 Register values after Analyse 



/ MemStart if bootstrapping from a link, or the external memory bootstrap address if 
bootstrapping from ROM. 

W MemStart if bootstrapping from ROM, or the address of the first free word after the 
bootstrap program if bootstrapping from link. 

A The value of / when the processor halted. 

B The value of IV when the processor halted, together with the priority of the process 
when the transputer was halted (i.e. the IV descriptor). 

C The ID of the bootstrapping link if bootstrapping from link. 
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5.8 



Error 



The Error pin is connected directly to the internal Error flag and follows the state of that flag. If Error is 
high it indicates an error in one of the processes caused, for example, by arithmetic overflow, divide by zero, 
array bounds violation or software setting the flag directly (page 410). Once set, the Error flag is only cleared 
by executing the instruction testerr. The error is not cleared by processor reset, in order that analysis can 
identify any errant transputer (page 418). 

A process can be programmed to stop if the Error flag is set; It cannot then transmit erroneous data to other 
processes, but processes which do not require that data can still be scheduled. Eventually all processes 
which rely, directly or indirectly, on data from the process in error will stop through lack of data. 

By setting the HaltOn Error Uag the transputer itself can be programmed to halt if Error becomes set. If Error 
becomes set after HaltOnErrorhas been set, all processes on that transputer will cease but will not necessarily 
cause other transputers in a network to halt. Setting HaltOnError a^er Error will not cause the transputer to 
halt; this allows the processor reset and analyse facilities to function with the flags in indeterminate states. 

An alternative method of error handling Is to have the errant process or transputer cause all transputers to 
halt. This can be done by applying the Error output signal of the errant transputer to the EventReq pin of a 
suitably programmed master transputer. Since the process state is preserved when stopped by an error, the 
master transputer can then use the analyse function to debug the fault. When using such a circuit, note that 
the Error flag is in an indeterminate state on power up; the circuit and software should be designed with this 
in mind. 



Error checks can be removed completely to optimise the performance of a proven program; any unexpected 
error then occurring will have an arbitrary undefined effect. 

If a high priority process pre-empts a low priority one, status of the Error and HaltOnError flags is saved 
for the duration of the high priority process and restored at the conclusion of it. Status of the Error flag is 
transmitted to the high priority process but the HaltOnError flag is cleared before the process starts. Either 
flag can be altered in the process without upsetting the error status of any complex operation being carried 
out by the pre-empted low priority process. 

In the event of a transputer halting because of HaltOnError, the links will finish outstanding transfers before 
shutting down. If Analyse is asserted then all inputs continue but outputs will not make another access to 
memory for data. 

After halting due to the Error flag changing from to 1 whilst HaltOnError \s set, register /points two bytes 
past the instruction which set Error. After halting due to the Analyse pin being taken high, register / points 
one byte past the instruction being executed. In both cases /will be copied to register A 
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Figure 5.5 Error handling in a multi-transputer system 
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6 Memory 

The IMS T222 has 4 Kbytes of fast internal static memory for high rates of data throughput. Each internal 
memory access takes one processor cycle ProcClockOut (page 422). The transputer can also access an 
additional 60 Kbytes of external memory space. Internal and external memory are part of the same linear 
address space. Internal RAM can be disabled by holding DisablelntRAM high. All internal addresses are 
then mapped to external RAM. This pin should not be altered after Reset has been taken low. 

IMS T222 memory is byte addressed, with words aligned on two-byte boundaries. The least significant byte 
of a word is the lowest addressed byte. 

The bits in a byte are numbered to 7, with bit the least significant. The bytes are numbered from 0, with 
byte the least significant. In general, wherever a value is treated as a number of component values, the 
components are numbered in order of increasing numerical significance, with the least significant component 
numbered 0. Where values are stored in memory, the least significant component value is stored at the 
lowest (most negative) address. 

Internal memory starts at the most negative address #8000 and extends to #8FFF. User memory begins at 
#8024; this location is given the name MemStart. 

The reserved area of internal memory below MemStart is used to implement link and event channels. 

Two words of memory are reserved for timer use, TPtrLocO for high priority processes and TPtrLod for low 
priority processes. They either indicate the relevant priority timer is not in use or point to the first process on 
the timer queue at that priority level. 

Values of certain processor registers for the current low priority process are saved in the reserved IntSaveLoc 
locations when a high priority process pre-empts a low priority one. 

External memory space starts at #9000 and extends up through #0000 to #7FFF. ROM bootstrapping code 
must be in the most positive address space, starting at #7FFE. Address space immediately below this is 
conventionally used for ROM based code. 
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hi Machine map lo 


Byte address Word offsets 


Occam map 






Reset Inst 
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Figure 6.1 IMS T222 memory map 



1 These locations are used as auxiliary processor registers and should not be manipulated by the user. Like 
processor registers, their contents may be useful for implementing debugging tools (Analyse, page 418). For 
details see Transputer lr)struction Set - A Compiler Writers' Guide. 
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7 External memory interface 

The IMS T222 External Memory Interface (EMI) allows access to a 16 bit address space via separate address 
and data buses. The data bus can be configured for either 16 bit or 8 bit memory access, allowing the use of 
a single bank of byte-wide memory. Both word-wide and byte-wide access may be mixed In a single memory 
system (page 428). 



7.1 



ProcClockOut 



This clock is derived from the internal processor clock, which Is in turn derived from Clockln. Its period Is 
equal to one internal microcode cycle time, and can be derived from the formula 

TPCLPCL = TDCLDCL / PLLx 

where TPCLPCL is the ProcClockOut Period, TDCLDCL is the Clockln Period and PLLx is the phase 
lock loop factor for the relevant speed part, obtained from the ordering details (Ordering section). 

Edges of the various external memory strobes are synchronised by, but do not all coincide with, rising or 
falling edges of ProcClockOut. 



Table 7.1 ProcClockOut 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPCLPCL 
TPCHPCL 
TPCLPCH 
TPCstab 


ProcClockOut period 
ProcClockOut pulse width high 
ProcClockOut pulse width low 
ProcClockOut stability 


a-2 
b-7 


a 
b 




a+2 

b+7 

8 


ns 
ns 
ns 
% 


1,5 
2,5 
3.5 
4,5 



Notes 



1 a is TDCLDCL/PLLx. 

2 b is 0.5*TPCLPCL (half the processor clock period). 

3 c is TPCLPCL-TPCHPCL. 

4 Stability is the variation of cycle periods between two consecutive cycles, measured at corresponding points on 
the cycles. 

5 This parameter is sampled and not 100% tested. 
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Figure 7.1 IMS T222 ProcClockOut timing 
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7.2 Tstates 

The external memory cycle is divided Into four Tstates with the following functions: 

T1 Address and control setup time. 

T2 Data setup time. 

T3 Data read/write. 

T4 Data and address hold after access. 

Each Tstate is half a processor cycle TPCLPCL long. An external memory cycle is always a complete number 
of cycles TPCLPCL in length and the start of T1 always coincides with a rising edge of ProcClockOut. T2 
can be extended indefinitely by adding externally generated wait states of one complete processor cycle each. 

7.3 Internal access 

During an internal memory access cycle the external memory interface address bus MemAO-15 reflects the 
word address used to access internal RAM, notMemWrBO-1 and notMemCE are inactive and the data bus 
MemDO-15 is tristated. This is true unless and until a DMA (memory request) activity takes place, when the 
lines will be placed in a high impedance state by the transputer. 

Bus activity is not adequate to trace the internal operation of the transputer in full, but may be used for 
hardware debugging in conjuction with peek and poke (page 418). 

7.4 MemAO-15 

External memory addresses are output on a non-multiplexed 16 bit bus. The address is valid at the start of 
T1 and remains so until the end of T4, with the timing shown. Byte addressing Is carried out internally by the 
IMS T222 for read cycles. For write cycles the relevant bytes In memory are addressed by the write enables 
notMemWrBO-1 . 

The transputer places the address bus In a high Impedance state during DMA. 

7.5 MemDO-15 

The non-multiplexed data bus is 16 bits wide. Read cycle data may be set up on the bus at any time after 
the start of T1, but must be valid when the IMS T222 reads it during T4. Data can be removed any time after 
the rising edge of notMemCE, but must be off the bus no later than the middle of T1, which allows for bus 
turn-around time before the data lines are driven at the start of T2 in a processor write cycle. 

Write data is placed on the bus at the start of T2 and removed at the end of T4. It is normally written into 
memory in synchronism with notMemCE going high. 

The data bus is high impedance except when the transputer is writing data. If only one byte is being written, 
the unused 8 bits of the bus are high impedance at that time. In byte access mode MemD8-15 are high 
impedance during the external memory cycle which writes the most significant (second) byte (page 428). 

If the data setup time for read or write is too short it can be extended by inserting wait states at the end of 
T2 (page 429). 
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Table 7.2 Read 



SYMBOL 


PARAMETER 


T222-20 


T222-17 


UNITS 


NOTE 


MIN 


MAX 


MIN 


MAX 


TAVEL 


Address valid before chip enable low 


8 




12 




ns 


1 


TELEH 


Chip enable low 


68 


80 


83 


88 


ns 


1 


TEHEL 


Delay before chip enable re-assertion 


19 




24 




ns 


1.2 


TEHAX 


Address hold after chip enable high 


3 




12 




ns 


1 


TELDrV 


Data valid from chip enable low 





50 





53 


ns 




TAVDrV 


Data valid from address valid 





63 





65 


ns 




TDrVEH 


Data setup before chip enable high 


22 




30 




ns 




TEHDrZ 


Data hold after chip enable high 





20 





24 


ns 




TWEHEL 


Write enable setup before chip enable low 


18 




24 




ns 


3 


TPCHEL 


ProcClockOut high to chip enable low 


8 




12 




ns 


1 



Notes 



1 This parameter is common to read and write cycles and to byte-wide memory accesses. 

2 These values assume back-to-back external memory accesses. 

3 Timing is for both write enables notMemWrBO-1 . 
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Figure 7.2 IMS T222 external read cycle 



7.6 



notMemWrBO-1 



Two write enables are provided, one to write each byte of the word. When writing a word, both write enables 
are asserted; when writing a byte only the appropriate write enable is asserted. notMemWrBO addresses 
the least significant byte. The write enables are active before the chip enable signal notMemCE becomes 
active, thus reducing memory access time and the risk of bus contention. 

The write enables are synchronised with the chip enable signal notMemCE, allowing them to be used without 
notMemCE for simple designs. 
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Data may be strobed into memory using notMemWrBO-1 without the use of notMemCE, as the write enables 
go high between consecutive external memory write cycles. The write enables are placed in a high impedance 
state during DMA, and are inactive during internal memory access. 

Table 7.3 Write 



SYMBOL 


PARAMETER 


T222-20 


T222-17 


UNITS 


NOTE 


MIN 


MAX 


MIN 


MAX 


TDwVEH 
TEHDwZ 
TDwZEL 
TWELEL 
TEHWEH 


Data setup before chip enable high 
Data hold after write 
Write data invalid to next chip enable 
Write enable setup before chip enable low 
Write enable hold after chip enable high 


50 
5 
1 

-8 
-3 


25 

3 
6 


57 

12 
12 
-4 



17 


4 


ns 
ns 
ns 
ns 
ns 


1 
1 



Notes 



1 Timing is for both write enables notMemWrBO-1. 
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Figure 7.3 IMS T222 external write cycle 
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7.7 notMemCE 

The active low signal notMemCE is used to enable external memory on both read and write cycles. 
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Figure 7.5 IMS T222 static RAM application 
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Figure 7.6 IMS T222 Least significant byte write in word 
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Figure 7.7 IMS T222 Most significant byte write to byte-wide memory 
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7.8 



Mem B Ace 



The IMS T222 will, by default, perform word access at even memory locations. Access to byte-wide memory 
can be achieved by taking MemBAcc high with the timing shown. Where all external memory operations are 
to byte-wide memory, MemBAcc may be wired permanently high. The state of this signal is latched during 
T2. 

If MemBAcc is low then a full word will be accessed in one external memory cycle, otherwise the high and 
low bytes of the word will be separately accessed during two consecutive cycles. The first (least significant) 
byte is accessed at the word address (MemAO is low). The second (most significant) byte is accessed at the 
word address +1 (MemAO is high). 

With MemBAcc high, the first cycle is identical with a normal word access cycle. However, it will be im- 
mediately followed by another memory cycle, which will use MemDO-7 to read or write the second (most 
significant) byte of data. During this second cycle notMemWrBI remains high, both for read and write, and 
MemD8-15 are high impedance. When writing a single byte with MemBAcc high, both the first and second 
cycles are performed with notMemWrBO asserted in the appropriate cycle. 

Table 7.4 Byte-wide memory access 



SYMBOL 


PARAMETER 


T222-20 


T222-17 


UNITS 


NOTE 


MIN 


MAX 


MIN 


MAX 


TELBAH 
TELBAL 


MemBAcc high from chip enable 
MemBAcc low from chip enable 


32 


12 


29 


15 


ns 
ns 
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Figure 7.8 IMS T222 word write to byte-wide memory 
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7.9 



MemWait 



Taking MemWait high with the timing shown in the diagram will extend the duration of T2 by one processor 
cycle TPCLPCL. One wait state comprises the pair W1 and W2. MemWait is sampled during T2, and should 
not change state in this region, if MeijiWait is still high when sampled in W2 then another wait period will be 
inserted. This can continue indefinitely. Internal memory access is unaffected by the number of wait states 
selected. 

The wait state generator can be a simple digital delay line, synchronised to notMemCE. The Single Wait 
State Generator circuit in figure 7.10 can be extended to provide two or more wait states, as shown in 
figure 7.11. 

Table 7.5 Memory wait 



SYMBOL 


PARAMETER 


T222-20 


T222-17 


UNITS 


NOTE 


MIN 


MAX 


MIN 


MAX 


TPCHWtH 
TPCHWtL 


MemWait asserted after ProcClockOut high 
Wait low after ProcClockOut high 


45 


25 


39 


27 


ns 
ns 





Tstate 

ProcClockOut / 

MemAO-15 ) 


T1 1 T2 

r \ 


1 W1 1 W2 1 


T3 1 T4 1 T1 1 




Address 


X 


notMemCE 






T D O Ul \A/ ♦ 1 


/ \ 


^ ^ 




1 rOnWlL 


TPCHWtH 




MemWait / 


</yy 


N 


k\\ 


///// 
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? <. 
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Figure 7.9 IMS T222 memory wait timing 
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Figure 7.10 Single wait state generator 
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Figure 7.1 1 Extendable wait state generator 
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7.10 MemReq, MemGranted 

Direct memory access (DMA) can be requested at any time by taking the asynchronous MemReq input high. 
For external memory cycles, the IMS T222 samples MemReq during the first high phase of ProcClockOut 
after notMemCE goes low. In the absence of an external memory cycle, MemReq is sampled during every 
rising edge of ProcClockOut. MemAO-15, MemDO-15, notMemWrBO-1 and notMemCE are tristated before 
MemGranted is asserted. 



Removal of MemReq is sampled at each rising edge of ProcClockOut and MemGranted removed with 
the timing shown. Further external bus activity, either external cycles or reflection of internal cycles, will 
commence during the next low phase of ProcClockOut. 

Chip enable, write enables, address bus and data bus are in a high impedance state during DMA. External 
circuitry must ensure that notMemCE and notMemWrBO-1 do not become active whilst control is being 
transferred; it is recommended that a 10K resistor is connected from VCC to each pin. DMA cannot interrupt 
an external memory cycle. DMA does not interfere with internal memory cycles in any way, although a program 
running in internal memory would have to wait for the end of DMA before accessing external memory. DMA 
cannot access internal memory. 
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Figure 7.12 IMS T222 DMA sequence at reset 
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Figure 7.13 IMS T222 operation of MemReq and MemGranted with external and internal memory cycles 
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DMA allows a bootstrap program to be loaded into external RAM ready for execution after reset. If MemReq is 
held high throughout reset, MemGranted will be asserted before the bootstrap sequence begins. MemReq 
must be high at least one period TDCLDCL of Clockln before Reset. The circuit should be designed to 
ensure correct operation if Reset could interrupt a normal DMA cycle. 

Table 7.6 Memory request 



SYMBOL 


PARAMETER 


T222-20 


T222-17 


UNITS 


NOTE 


MIN 


MAX 


MIN 


MAX 


TMRHMGH 

TMRLMGL 

TAZMGH 

TAVMGL 

TDZMGH 

TEZMGH 

TEVMGL 

TWEZMGH 

TWEVMGL 


Memory request response time 
Memory request end response time 
Addr. bus tristate before MemGranted 
Addr. bus active after MemGranted end 
Data bus tristate before MemGranted 
Chip enable tristate before MemGranted 
Chip enable active after MemGranted end 
Write enable tristate before MemGranted 
Write enable active after MemGranted end 


75 
80 




-6 

-6 


a 

155 


100 
100 
















a 

114 


ns 
ns 
ns 
ns 
ns 
ns 
ns 
ns 
ns 


1 

2 
2 



Notes 



1 Maximum response time a depends on whether an external memory cycle is in progress and whether byte 
access is active. Maximum time is (2 processor cycles) + (number of wait state cycles) for word access; in byte 
access mode this time is doubled. 

2 When using DMA, notMemCE and notMemWrBO-1 should be pulled up with a resistor (typically 1.2k). Capac- 
itance should be limited to a maximum of SOpF. 
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Figure 7.14 IMS T222 memory request timing 
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Events 



EventReq and EventAck provide an asynchronous handshake interface between an external event and an 
internal process. When an external event takes EventReq high the external event channel (additional to the 
external link channels) is made ready to communicate with a process. When both the event channel and the 
process are ready the processor takes EventAck high and the process, if waiting, is scheduled. EventAck 
is removed after EventReq goes low. 

Only one process may use the event channel at any given time. If no process requires an event to occur 
EventAck will never be taken high. Although EventReq triggers the channel on a transition from low to high, 
it must not be removed before EventAck is high. EventReq should be low during Reset; if not it will be 
ignored until it has gone low and returned high. EventAck is taken low when Reset occurs. 

If the process is a high priority one and no other high priority process is running, the latency is as described 
on page 407. Setting a high priority task to wait for an event input allows the user to interrupt a transputer 
program running at low priority. The time taken from asserting EventReq to the execution of the microcode 
interrupt handler in the CPU is four cycles. The following functions take place during the four cycles: 

Cycle 1 Sample EventReq at pad on the rising edge of ProcClockOut and synchronise. 

Cycle 2 Edge detect the synchronised EventReq and form the interrupt request. 

Cycle 3 Sample interrupt vector for microcode ROM in the CPU. 

Cycle 4 Execute the interrupt routine for Event rather than the next instruction. 

Table 8.1 Event 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TVHKH 
TKHVL 
TVLKL 
TKLVH 


Event request response 

Event request hold 

Delay before removal of event acknowledge 

Delay before re-assertion of event request 










a+7ns 


ns 
ns 

ns 


1 



Notes 



1 a is 3 processor cycles TPCLPCL. 
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Figure 8.1 IMS T222 event timing 
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Four identical INMOS bi-directional serial links provide synchronized communication between processors 
and with the outside world. Each link comprises an input channel and output channel. A link between two 
transputers is implemented by connecting a link interface on one transputer to a link interface on the other 
transputer. Every byte of data sent on a link is acknowledged on the input of the same link, thus each signal 
line carries both data and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged data byte 
and that the receiving link is able to receive another byte. The sending link reschedules the sending process 
only after the acknowledge for the final byte of the message has been received. 

The IMS T222 links allow an acknowledge packet to be sent before the data packet has been fully received. 
This overlapped acknowledge technique is fully compatible with all other INMOS transputer links. The hard 
output channels are not double buffered. There is thus a pause between transmission of the last byte of 
a word of the message and the first byte of the next word. This pause time is related to memory speed. 
Hard input channels have one byte of double buffering and are unlikely to affect the data rate. The dominant 
factor affecting link bandwidth is therefore the memory bandwidth of the transmitting transputer, as shown in 
table 9.1. Internal memory access time is similar to zero wait state external access time. Times are for two 
interconnected IMS T222's with 20 Mbits/sec link speed. 

Table 9.1 Memory/Link speed relationship 



Memory Speed 
(20MHz device) 


Byte Output 
Time nS 


Word Memory 
Read nS 


Unidirectioal 
Data Rate Mbytes/sec 


1 cycle (0 wait) 

2 cycle (1 wait) 

3 cycle (2 wait) 


575 
575 
575 


200 
250 
300 


1.48 
1.42 
1.38 



The IMS T222 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they can 
be used at 5 or 20 Mbits/sec. Links are not synchronised with Clockln or ProcClockOut and are insensitive 
to their phases. Thus links from independently clocked systems may communicate, providing only that the 
clocks are nominally identical and within specification. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 



Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

Link speeds can be set by LinkSpecial, LinkOSpecial and Link123Speciai. The link speed can be 
set independently. Table 9.2 shows uni-directional and bi-directional data rates in Kbytes/sec for each link 
speed; LinknSpecial is to be read as LinkOSpecial when selecting link speed and as Link123Special for 
the others. Data rates are quoted for a transputer using internal memory, and will be affected by a factor 
depending on the number of external memory accesses and the length of the external memory cycle. 
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Table 9.2 Speed Settings for Transputer Links 



Link 
Special 


Linkn 
Special 


Mbits/sec 


Kbytes/sec 


Unl 


Bi 





1 
1 




1 


1 


10 
5 
10 
20 


800 

430 

800 

1480 


1130 

590 

1130 

2050 





|Ack| 


H'H 1 2 3 4|5 6 7 L, 


1 Data 1 



Figure 9.1 IMS T222 link data and acknowledge packets 



Table 9.3 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




lOMbits/s 






10 


ns 


2 




5 Mbits/s 






30 


ns 


2 


CLIZ 


Linkin capacitance @ f=1 MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for lOOQ transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 
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Figure 9.2 IMS T222 link timing 
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Figure 9.3 IMS T222 buffered link timing 
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Figure 9.4 IMS T222 Links directly connected 
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Figure 9.5 IMS T222 Links connected by transmission line 
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Figure 9.6 IMS T222 Links connected by buffers 



437 



10 Electrical specifications 
10.1 DC electrical characteristics 



Table 10.1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


vcc 


DC supply voltage 





7.0 


V 


1.2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC4-0.5 


V 


1,2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



1 All voltages are with respect to GND. 

2 This Is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it Is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and VCC. 



Table 10.2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 


4.75 


5.25 


V 


1 


VI, vo 


Input or output voltage 





VCC 


V 


1,2 


CL 


Load capacitance on any pin 




60 


PF 




TA 


Operating temperature range IMS T222-S 





70 


°C 


3 


TA 


Operating temperature range IMS T222-M 


-55 


125 


°C 


3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics, 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 10.3 DC characteristics 



SYIVIBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VCC+0.5 


V 


1.2 


VIL 


Low level input voltage 


-0.5 


0.8 


V 


1.2 


II 


Input current @ GND<VI<VCC 




±10 


mA 


1,2 


VOH 


Output high voltage @ IOH=2mA 


VCC-1 




V 


1.2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1.2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1.2,3,6 






65 


100 


mA 


1,2,4,6 


lOZ 


Tristate output current @ GND<VO<VCC 




±10 


mA 


1.2 


PD 


Power dissipation 




700 


mW 


2.5 


CIN 


Input capacitance @ f=1MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=1 MHz 




10 


PF 


6 



Notes 



1 All voltages are with respect to GND. 

2 Parameters for IMS T222-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading and program execution. 

6 This parameter is sampled and not 100% tested. 



10.2 Equivalent circuits 



Output 



50pF 4s I I R2 
GND - 



VCC 



R1 



-M- 



Load for: 


R1 


R2 


Equivalent load: 


Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky 1 IL inputs 



Diodes are 1 N91 6 



Figure 10.1 Load circuit for AC measurements 
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Figure 10.2 AC measurements timing waveforms 



Test point 
Output under test 



"X 




vcc 



510R 



GND ■ 1— 



Figure 10.3 Tristate load circuit for AC measurements 
10.3 AC timing cinaracteristics 

Table 10.4 Input, output edges 



SYIVIBOL 


PARAfVIETER 


IVIIN 


IVIAX 


UNITS 


NOTE 


TDr 
TDf 
TQr 
TQf 


Input rising edges 
Input falling edges 
Output rising edges 
Output falling edges 


2 
2 


20 
20 
25 
15 


ns 
ns 
ns 
ns 


1.2 
1,2 

1 

1 



Notes 



1 Non-link pins; see section on links. 

2 All inputs except Clockln; see section on Clockln. 
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Figure 10.4 IMS T222 input and output edge timing 
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ns 
20- 
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1 1 I i 1 1 i 1 
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Load Capacitance pF 




Load Capacitance pF 




Link 




ElVII 



Notes 



Figure 10.5 Typical rise/fall times 



1 Skew is measured between notMemCE witli a standard load (2 Schottky TTL inputs and SOpF) and 
notMemCE with a load of 2 Schottky TTL inputs and varying capacitance. 
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10.4 Power rating 

Internal power dissipation Pj^t of transputer and periplneral chips depends on VCC, as shown in figure 10.6. 
Pint 'S substantially independent of temperature. 

Total power dissipation Pd of the chip is 

Pd = Pint + Pio 

where Pjo is the power dissipation In the input and output pins; this is application dependent. 

Internal working temperature Tj of the chip is 

Tj^TA+eJA*PD 

where Ta is the external ambient temperature in °C and SJa is the junction-to-ambient thermal resistance in 
°C/W. OJa for each package is given in the Packaging Specifications section. 



500- 
Power 
PINT 400 J 

mW 



300" 




-T — I — I — I — I — I — I — I — I — I — I — I — I — r 

4.4 4.6 4.8 5.0 5.2 5.4 5.6 
VCC Volts 



Figure 10.6 IMS T222 internal power dissipation vs VCC 
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11 



Performance 



The performance of the transputer is measured in terms of the number of bytes required for the program, and 
the number of (internal) processor cycles required to execute the program. The figures here relate to OCCam 
programs. For the same function, other languages should achieve approximately the same performance as 
Occam. 



11.1 



Performance overview 



These figures are averages obtained from detailed simulation, and should be used only as an initial guide; 
they assume operands are of type INT. The abbreviations in table 1 1 .1 are used to represent the quantities 
indicated. In the replicator section of the table, figures in braces {} are not necessary if the number of 
replications is a compile time constant. To estimate performance, add together the time for the variable 
references and the time for the operation. 



Table 11.1 Key to performance table 



np 

ne 

r 

ts 

w 

P 

Eg 

Et 

Tb 

Tbp 

Tbc 

nsp 

nap 



number of component processes 

number of processes earlier in queue 

1 if INT parameter or array parameter, if not 

number of table entries (table size) 

width of constant in nibbles 

number of places to shift 

expression used in a guard 

timer expression used in a guard 

most significant bit set of multiplier ((-1) if the multiplier is 0) 

most significant bit set in a positive multiplier when counting from zero ((-1) if the multiplier is 0) 

most significant bit set in the two's complement of a negative multiplier 

Number of scalar parameters in a procedure 

Number of array parameters in a procedure 
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Table 1 1 .2 Performance 





Size (bytes) 


Time (cycles) 


Names 

variables 

in expression 
assigned to or input to 
in PROC or FUNCTION call, 
corresponding to an INT parameter 

channels 


1.1 +r 
1.1 +r 

1.1 +r 
1.1 


2.1+2(r) 
1.1 +(r) 

1.1 +(r) 
2.1 


Array Variables (for single dimension arrays) 
constant subscript 
variable subscript 
expression subscript 




5.3 
5.3 




7.3 
7.3 


Declarations 

CHAN OF protocol 
[size] CHAN OF protocol 
PROC 


3.1 

9.4 

body+2 


3.1 

2.2 + 20.2*size 




Primitives 

assignment 

input 

output 

STOP 

SKIP 



4 
1 
2 





26.5 

26 

25 




Arithmetic operators 
+ 

fr 
/ 

REM 

» « 


1 
2 
2 
2 
2 


1 

23 
24 
22 
3+p 


Modulo Arithmetic operators 
PLUS 
MINUS 
TIMES (fast multiply) 


2 

1 

1 


2 

1 

4+Tb 


Boolean operators 
OR 
AND NOT 


4 
1 


8 
2 


Comparison operators 
= constant 
= variable 
<> constant 
<> variable 
> < 
>= <= 



2 

1 
3 
1 
2 


1 
3 
3 
5 
2 
4 


Bit operators 

A \/ X - 


2 


2 


Expressions 

constant in expression 
check If error 


w 

4 


w 

6 
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Table 1 1 .3 Performance 





Size (bytes) 


Time (cycles) 


Timers 






timer input 


2 


3 


timer AFTER 






if past time 


2 


4 


with empty timer queue 


2 


31 


non-empty timer queue 


2 


38+ne*9 


ALT (timer) 






with empty timer queue 


6 


52 


non-empty timer queue 


6 


59+ne*9 


timer alt guard 


8+2Eg+2Et 


34+2Eg+2Et 


Constructs 






SEQ 








IF 


1.3 


1.4 


if guard 


3 


4.3 


ALT (non timer) 


6 


26 


alt channel guard 


10.2+2Eg 


20+2Eg 


skip alt guard 


8+2Eg 


10+2Eg 


PAR 


11.5+(np-1)*7.5 


19.5+(np-1)*30.5 


WHILE 


4 


12 


Procedure or function call 








3.5+(nsp-2)*1.1 


16.5+(nsp-2)*1.1 




+nap*2.3 


+nap*2.3 


Replicators 






replicated SEQ 


7.3{+5.1} 


(-3.8)+15.1*count{+7.1} 


replicated IF 


12.3{+5.1} 


(-2.6)+19.4*count{+7.1} 


replicated ALT 


24.8{+10.2} 


25.4+33.4*count{+14.2} 


replicated timer alt 


24.8{+10.2} 


62.4+33.4*count{+14.2} 


replicated PAR 


39.1 {+5.1} 


(-6.4)+70.9*count{+7.1} 



1 1 .2 Fast multiply, times 

The IMS T222 has a fast integer multiplication instruction product. The time taken for a fast multiply is 4+Tb. 
The time taken for a multiplication by zero is 3 cycles. For example, if the multiplier is 1 the time taken is 
4 cycles, if the multiplier is -1 (all bits set) the time taken is 19 cycles. 

Implementations of high level languages on the transputer may take advantage of this instruction. For example, 
the Occam modulo arithmetic operator times is implemented by the instruction and the right-hand operand is 
treated as the multiplier. The fast multiplication instruction is also used in high level language implementations 
for the multiplication implicit in multi-dimensional array access. 



11.3 



Aritlimetic 



A set of functions are provided within the development system to support the efficient implementation of 
multiple length integer arithmetic and floating point arithmetic where relevant. In table 1 1 .4 n gives the number 
of places shifted and all arguments and results are assumed to be local. Full details of these functions are 
provided in the OCCam reference manual, supplied as part of the development system and available as a 
separate publication. 

When calculating the execution time of the predefined maths functions, no time needs to be added for calling 
overhead. These functions are compiled directly into special purpose instructions which are designed to 
support the efficient implementation of multiple length integer arithmetic and floating point arithmetic. 
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Table 1 1 .4 Arithmetic performance 



Function 


Cycles 


+ cycles for 
parameter access 


t 


LONGADD 




2 


7 




LONGSUM 




3 


8 




LONGSUB 




2 


7 




LONGDIFF 




3 


8 




LONGPROD 




18 


8 




LONGDIV 




20 


8 




SHIFTRIGHT 

SHIFTLEFT 

NORMALISE 


(n<16) 

(n>=16) 

(n<16) 

(n>=16) 

(n<16) 

(n>=16) 


4+n 

n-11 
4+n 
n-11 
n+6 
n-9 


8 
8 
8 
8 
7 
7 




ASHIFTRIGHT 


(n=32) 


4 
SHIFTRIGHT+2 


7 
5 




ASHIFTLEFT 




SHIFTLEFT+4 


5 




ROTATERIGHT 




SHIFTRIGHT 


7 




ROTATELEFT 




SHIFTLEFT 


7 





t Assuming local variables. 



11.4 Floating point operations 



Floating point operations for the IMS T222 are provided by a run-time package. This requires approximately 
2000 bytes of memory for the double length arithmetic operations, and 2500 bytes for the quadruple length 
arithmetic operations. Table 1 1.5 summarizes the estimated performance of the package. 



Table 1 1 .5 IMS T222 floating point operations performance 





Processor cycles 


IMS T222 


Typical 


Worst 


REAL32 + - 

* 

/ 

<> = >=<=<> 

REAL64 + - 

* 

/ 


530 

650 

1000 

60 

875 

1490 

2355 

60 


705 

705 

1410 

60 

1190 

1950 

3255 

60 
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11.5 Effect of external memory 

Extra processor cycles may be needed when program and/or data are held in external memory, depending 
both on the operation being performed, and on the speed of the external memory After a processor cycle 
which initiates a write to memory, the processor continues execution at full speed until at least the next 
memory access. 

Whilst a reasonable estimate may be made of the effect of external memory the actual performance will 
depend upon the exact nature of the given sequence of operations. 

External memory Is characterized by the number of extra processor cycles per external memory cycle, denoted 
as e. The value of e for the IMS T222 with no wait states is 1 . 

If a program is stored in external memory, and e has the value 2 or 3, then no extra cycles need be estimated 
for linear code sequences. For larger values of e, the number of extra cycles required for linear code 
sequences may be estimated at (2e-1)/4 per byte of program. A transfer of control may be estimated as 
requiring e+3 cycles. 

These estimates may be refined for various constructs. In table 11 .6 n denotes the number of components in 
a construct. In the case of IF, the n'th conditional Is the first to evaluate to TRUE, and the costs include the 
costs of the conditionals tested. The number of bytes in an array assignment or communication is denoted 
byb. 

Table 1 1 .6 External memory performance 





IMS T222 1 


Program off chip 


Data off chip 


Boolean expressions 


e-1 





IF 


3en-1 


en 


Replicated IF 


6en+9e-12 


(5e-2)n+6 


Replicated SEQ 


(4e-3)n+3e 


(4e-2)n+3-e 


PAR 


4en 


3en 


Replicated PAR 


(17e-12)n+9 


16en 


ALT 


(4e-1)n+9e-4 


(4e-1)n+9e-3 


Array assignment and 





max (2e, eb) 


communication in 






one transputer 







The following simulation results illustrate the effect of storing program and/or data in external memory. The 
results are normalized to 1 for both program and data on chip. The first program (Sieve of Erastosthenes) 
is an extreme case as it Is dominated by small, data access intensive loops; it contains no concurrency, 
communication, or even multiplication or division. The second program is the pipeline algorithm for Newton 
Raphson square root computation. 
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Table 11.7 IMS T222 external memory performance 





Program 


e=1 


8=2 


e=3 


6=4 


On chip 


Program off chip 


1 
2 


1.2 
1.1 


1.4 
1.2 


1.8 
1.4 


2.1 
1.6 




Data off chip 


1 
2 


1.2 
1.1 


1.5 
1.3 


1.8 
1.4 


2.1 
1.6 




Program and data off chip 


1 
2 


1.4 
1.2 


1.9 
1.5 


2.5 
1.8 


3.0 
2.1 





11.6 Interrupt latency 

If the process is a high priority one and no other high priority process is running, the latency is as described 
in table 1 1 .8. The timings given are in full processor cycles TPCLPCL; the number of Tm states is also given 
where relevant. Maximum latency assumes all memory accesses are internal ones. 



Table 1 1 .8 Interrupt latency 





Typical 


Maximum 


TPCLPCL 


Tm 


TPCLPCL 


Tm 


IMS T222 


19 




53 
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12 Package specifications 

12.1 68 pin grid array package 
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Top 


View 



■v^>-s 



y 




GND 



hiTY^ 



Mem 
WrBI 



ot 

Mem 
WrBO 




Figure 12.1 IMS T222 68 pin grid array package pinout 
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Figure 12.2 68 pin grid array package dimensions 



Table 12.1 68 pin grid array package dimensions 



DIIVI 


r\/lillimetres 


Indies 


Notes 


NOIVI 


TOL 


NOIVI 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B 


17.019 


±0.127 


0.670 


±0.008 




C 


2.466 


±0.279 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.051 


0.018 


±0.002 


Pin diameter 


G 


1.270 


±0.127 


0.050 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Package weight is approximately 6.8 grams 

Table 12.2 68 pin grid array package junction to ambient thermal resistance 



SYIVIBOL 


PARAIVIETER 


IVIIN 


NOIVI 


MAX 


UNITS 


NOTE 


(9JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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12.2 68 pin PLCC J-bend package 
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Figure 12.3 IMS T222 68 pin PLCC J-bend package pinout 
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Figure 12.4 68 pin PLCC J-bend package dimensions 



Table 12.3 68 pin PLCC J-bend package dimensions 



DIIVI 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


25.146 


±0.127 


0.990 


±0.005 




B 


24.232 


±0.127 


0.954 


±0.005 




C 


3.810 


±0.127 


0.150 


±0.005 




D 


0.508 


±0.127 


0.020 


±0.005 




F 


1.270 


±0.127 


0.050 


±0.005 




G 


0.457 


±0.127 


0.018 


±0.005 




J 


0.000 


±0.051 


0.000 


±0.002 




K 


0.457 


±0.127 


0.018 


±0.005 




L 


0.762 


±0.127 


0.030 


±0.005 





Package weight is approximately 5.0 grams 

Table 12.4 68 pin PLCC J-bend package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 




35 




°c/w 





452 



13 Ordering 

This section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 



Table 13.1 IMS T222 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IMS T222-G17S 
IMS T222-G20S 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Ceramic Pin Grid 
Ceramic Pin Grid 


IMS T222-J17S 
IMS T222-J20S 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Plastic J-Bend 
Plastic J-Bend 


IMS T222-G17M 
IMS T222-G20M 


17.5 MHz 
20.0 MHz 


57 ns 
50 ns 


3.5 
4.0 


Ceramic Pin Grid MIL Spec 
Ceramic Pin Grid MIL Spec 
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Product Preview 



FEATURES 

1 6 bit architecture 

33 ns internal cycle time 

30 MIPS (peak) instruction rate 

IMS T225-20 is pin compatible with IMS T222-20 

Debugging support 

4 Kbytes on-chip static RAM 

60 Mbytes/sec sustained data rate to internal memory 

64 Kbytes directly addressable external memory 

30 Mbytes/sec sustained data rate to external memory 

630 ns response to interrupts 

Four INMOS serial links 5/10/20 Mbits/sec 

Bi-directional data rate of 2.4 Mbytes/sec per link 

Internal timers of 1 /iS and 64 //s 

Boot from ROM or communication links 

Single 5 MHz clock input 

Single +5V ±5% power supply 

MIL-STD-883C processing will be available 

APPLICATIONS 

Real time processing 

Microprocessor applications 

High speed multi processor systems 

Industrial control 

Robotics 

System simulation 

Digital signal processing 

Telecommunications 

Fault tolerant systems 

Medical instrumentation 
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1 Introduction 

The IMS T225 transputer is a 16 bit CMOS microcomputer with 4 Kbytes on-chip RAM for high speed 
processing, an external memory interface and four standard INMOS communication links. The instruction set 
achieves efficient Implementation of high level languages and provides direct support for the Occam model 
of concurrency when using either a single transputer or a network. Procedure calls, process switching and 
typical interrupt latency are sub-microsecond. A device running at 30 MHz achieves an instruction throughput 
of 15 MIPS. 



For convenience of description, the IMS T225 operation is split into the basic blocks shown in figure 1.1. 
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Error - 
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Figure 1.1 IMS T225 block diagram 

The IMS T225 is functionally equivalent to the IMS T222 but has the addition of three speed select pins 
(ProcSpeedSelectO-2) and improved links. The IMS T225 is pin compatible with the IMS T222 and is 
a direct replacement in many applications. The IMS T225 can directly access a linear address space of 
64 Kbytes. The 16 bit wide non-multiplexed external memory interface provides a data rate of up to 2 bytes 
every 100 nanoseconds (20 Mbytes/sec) for a 20 MHz device. 

System Services include processor reset and bootstrap control, together with facilities for error analysis. 



The INMOS communication links allow networks of transputers to be constructed by direct point to point 
connections with no external logic. The links support the standard operating speed of 10 Mbits/sec, but also 
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operate at 5 or 20 Mbits/sec. The links have been Improved over those of the IMS T222 and fully support 
overlapped acknowledge; each IMS T225 link can transfer data bi-directionally at up to 2.4 Mbytes/sec. The 
link speed settings are the same as those on the IMS T800 (see page 241). 

The IMS T225 instruction set contains a number of instructions to facilitate the implementation of breakpoints. 
For further information concerning breakpointing, refer to Support for debugging/breakpointing in transputers 
(technical note 61). 
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Pin designations 



Table 2.1 IMS T225 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPlus, CapMinus 




External capacitor for internal clock power supply 


Clockln 


in 


Input clock 


ProcSpeedSelectO-2 


in 


Processor speed selectors 


Reset 


in 


System reset 


Error 


out 


Error indicator 


Analyse 


in 


Error analysis 


BootFromRom 


in 


Bootstraps from external ROM or from link 


DIsablelntRAM 


in 


Disable internal RAM 



Table 2.2 IMS T225 external memory interface 



Pin 


In/Out 


Function 


ProcClockOut 


out 


Processor clock 


MemAO-15 


out 


Sixteen address lines 


MemDO-15 


in/out 


Sixteen data lines 


notMemWrBO-1 


out 


Two byte-addressing write strobes 


notMemCE 


out 


Chip enable 


Mem B Ace 


in 


Byte access mode selector 


MemWait 


in 


Memory cycle extender 


MemReq 


in 


Direct memory access request 


Mem Granted 


out 


Direct memory access granted 



Table 2.3 IMS T225 event 



Pin 


In/Out 


Function 


EventReq 
EventAck 


in 
out 


Event request 

Event request acknowledge 



Table 2.4 IMS T225 link 



Pin 


In/Out 


Function 


LlnklnO-3 


in 


Four serial data input channels 


LinkOutO-3 


out 


Four serial data output channels 


LinkSpecial 


in 


Select non-standard speed as 5 or 20 Mbits/sec 


LInkOSpeclal 


in 


Select special speed for Link 


Llnk123Special 


in 


Select special speed for Links 1 ,2,3 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 459. 
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3 Instruction set summary 

The invStruction set of the IMS T225 is the same as that of the IMS T222 with a number of additions. The 
instructions additional to those of the IMS T222 are listed below. 

The load device identity (Iddevid) instruction (table 3.4) pushes the device type identity Into the A register. 
Each product is allocated a unique group of numbers for use with the /c/dew'cf instruction. The product identity 
numbers for the IMS T225 are 40 to 49 inclusive. 

Table 3.5 contains a number of instructions to facilitate the implementation of breakpoints. These instructions 
overload the operation of JO. Normally yO is a no-op which might cause descheduling. Se/yot>rea/c enables the 
breakpointing facilities and causes ;'(? to act as a breakpointing instruction. When breakpointing is enabled, 
jO swaps the current Iptr and Wptr with an Iptr and Wptr stored above MemStart. The breakpoint instruction 
does not cause descheduling, and preserves the state of the registers. It is possible to single step the pro- 
cessor at machine level using these instructions. Refer to Support for debugging/breakpointing in transputers 
(technical note 61) for more detailed information regarding debugger support. 

Table 3.1 IMS T225 arithmetic/logical operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


08 


F8 


prod 


b+4 
m+5 


product for positive register A 
product for negative register A 





Table 3.2 IMS T225 general operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


5A 
79 


^5FA 
27F9 


dup 
pop 


1 
1 


duplicate top of stack 
pop processor stack 





Table 3.3 IMS T225 ORG and bit operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


74 
75 

76 
77 
78 


27F4 
27F5 

27F6 
27F7 
27F8 


crcword 
crcbyte 

bitcnt 

bitrevword 

bitrevnbits 


35 

11 

b+2 

36 
n+4 


calculate crc on word 
calculate crc on byte 

count bits set in word 
reverse bits in word 
reverse bottom n bits in word 





Table 3.4 IMS T225 processor initialisation operation codes 



Operation 
Code 


Memory 
Code 


Mnemonic 


Processor 
Cycles 


Name 


D 

E 


17C 
7E 


2127FC 
27FE 


Iddevid 
Idmemstartval 


1 
1 


load device identity 

load value of memstart address 
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Table 3.5 IMS T225 debugger support codes 



Operation 


Memory 




Processor 




D 


Code 


Code 


Mnemonic 


Cycles 


Name 


E 





00 


jumpO 


3 

11 
13 


jump (break not enabled) 

jump (break enabled, high priority) 

jump (break enabled, low priority) 


D 


B1 


2BF1 


break 


9 

11 


break (high priority) 
break (low priority) 




B2 


2BF2 


cIrjObreak 


1 


clear jump break enable flag 




B3 


2BF3 


setjObreak 


1 


set jump break enable flag 




84 


2BF4 


testjObreak 


2 


test jump break enable flag set 




7A 


27FA 


timerdisableh 


1 


disable high priority timer interrupt 




7B 


27FB 


timerdisablel 


1 


disable low priority timer interrupt 




70 


27FG 


timerenableh 


6 


enable high priority timer interrupt 




ID 


27FD 


timerenablel 


6 


enable low priority timer interrupt 
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4 Package specifications 

4.1 68 pin grid array package 
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Figure 4.1 IMS T225 68 pin grid array package pinout 
Details of the 68 pin grid array package dimensions are given on page 449 
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4.2 



68 pin PLCC J-bend package 
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Figure 4.2 IMS T225 68 pin PLCC J-bend package pinout 
Details of the 68 pin PLCC J-bend package dimensions are given on page 451. 
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5 Ordering 

This section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 



Table 5.1 IMS T225 ordering details 



INMOS 
designation 


Processor 
clock speed 


Processor 
cycle time 


PLLx 


Package 


IIVIS T225-G17S 
llViS T225-G20S 
IIVIS T225-G25S 
IMS T225-G30S 

IMS T225-J17S 
IMS T225-J20S 


17.5 MHz 
20.0 MHz 
25.0 MHz 
30.0 MHz 

17.5 MHz 
20.0 MHz 


57 ns 
50 ns 
40 ns 
33 ns 

57 ns 
50 ns 


3.5 
4.0 
5.0 
6.0 

3.5 
4.0 


Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 
Ceramic Pin Grid 

Plastic J-Bend 
Plastic J-Bend 
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On-chip 16 bit processor 
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Bootstraps from ROM, link or disk 

Single 5 MHz processor clock input 
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Introduction 



The IMS M212 peripheral processor is an intelligent peripheral controller of the INMOS transputer family, 
configured for connection to soft sectored Winchester and floppy disk drives. It satisfies the demand for 
increasing intelligence in peripheral controllers and maintains a high degree of flexibility, allowing designers 
to modify the controller function without altering the hardware. 
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Figure 1 .1 IMS M212 block diagram 



The disk control function has been designed to provide easy connection, with minimal external hardware, 
to a standard Winchester and/or floppy disk interface. Two byte-wide programmable bidirectional ports are 
provided to control and monitor disk functions such as head position, drive selection and disk status. A 
dedicated port is provided for serial data interfaces and critical timing signals. 
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The IMS M212 is programmed as a normal transputer, permitting extremely powerful peripheral control facil- 
ities to be built into the device and thus reducing the load on the traditional central processor of a computer. 
Full details are given in the IMS M212 Disk Processor Product Data manual. 

1.1 IMS M212 peripheral processor 

1.1.1 Central processor 

At the heart of the IMS M212 is a 16 bit processor which is compatible with the transputer family. Its design 
achieves compact programs, efficient high level language implementation and provides direct support for the 
Occam model of concurrency. The processor shares its time between any number of concurrent processes. 
A process waiting for communication or a timer does not consume any processor time. Two levels of process 
priority enable fast interrupt response to be achieved. 

The IMS M212 has been designed so that the on-chip processor performs as many functions as possible, 
providing flexible operation and minimising on-chip disk-specific hardware. 

1.1.2 Peripheral interface 

The two 8 bit data ports PAO-7 and PBO-7 are controlled by the processor via a pair of channels. This allows 
the programmer to modify the function of these ports in order to implement a wide variety of applications. 

The peripheral interface includes data output registers and TTL compatible input ports, as well as facilities 
for defining the direction of the pins on a bit-selectable basis. The interface contains logic to detect a change 
of state on the input pins and to store this change for interrogation by the program. 

In addition to this, the external memory interface can support memory mapped peripherals on its byte-wide 
data bus. An event pin is also provided, so that peripherals can request attention. 

1.1.3 Disk controller 

The disk interface provides a simple interconnection to ST506/ST412 and SA400/SA450 compatible disk 
drives via ten dedicated disk control lines and the two general purpose 8 bit bidirectional data ports PAO-7 
and PBO-7. Although the on-chip disk control hardware handles much of the specialised data conversion, 
as many disk operations as possible are controlled by the processor, using sequences of control and data 
information. 

The processor can program and interrogate all the registers controlling the disk functions and data ports, and 
thereby control the external interface lines. As a result of this versatility, the IMS M212 can also be used in 
applications other than disk control ones. 

A versatile hardware 32 bit Error Correcting Codes (ECC) and 16 bit Cyclic Redundancy Codes (CRC) 
generator is included to check data integrity. ECC's allow certain classes of errors to be corrected as well as 
detected, whilst CRC's only allow detection. 

When writing data to the disk the hardware serialises the data and encodes it into a Frequency Modulated 
(FM) or Modified Frequency Modulated (MFM) data stream. Any necessary precompensation is performed 
internally before outputting the data together with the necessary control signals. Any necessary modification 
of the data, for instance writing the Address Marks (AM) or inserting the CRC/ECC bytes, is automatically 
performed by the hardware. 

When reading data from disk the raw read data is input and the function known as data separation is performed 
internally. The hardware examines the data stream for an Address Mark to achieve byte synchronisation and 
then searches for the desired sector information. When the required data is located it is decoded and a serial 
to parallel conversion is performed before the data is transferred to the processor. 
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1.1.4 Links 

The IMS M212 uses a DMA block transfer mechanism to transfer messages between memory and another 
transputer product via the INMOS links. The link interfaces and the processor ail operate concurrently, 
allowing processing to continue while data is being transferred on all of the links. 

The host interface of the IMS M212 is via two INMOS standard links, providing simple connection to any 
transputer based system or, via a link adaptor, to a conventional microprocessor system. Link speeds of 
10 Mbits/sec and 20 Mbits/sec are available, making the device compatible with all other INMOS transputer 
products. 

The on-chip disk control logic is controlled by the processor, using simple command sequences, via two 
channels which appear to the processor as a normal pair of hardware channels. 

1.1.5 IVIemory system 

The 2 Kbytes of on-chip static RAM can be used for program or data storage, as a sector buffer or to store 
parameter and format information. It can be extended off chip, via the external memory interface, to provide 
a total of 64 Kbytes. Internal and external memory appear as a single contiguous address space. 

Software contained in 4 Kbytes of internal ROM enables the IMS M212 to be used as a stand alone disk 
processor. The ROM can be disabled to free the address space for external memory. 

1.1.6 Error handling 

High level language execution is made secure with array bounds checking, arithmetic overflow detection etc. 
A flag is set when an error is detected, and the error can be handled internally by software or externally by 
sensing the error pin. System state is preserved for subsequent analysis. 
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2 Operation 

The IMS M212 can be used in two modes: Mode 1 , which uses the software in the internal ROM, and Mode 2, 
which relies upon custom designed software. 

2.1 Mode 1 

Mode 1 operation uses code in the on-chip ROM to control the disk controller hardware, and little knowledge 
of the hardware is required to implement Winchester and floppy disk drivers. The programming interface to 
all drive types is identical, and there is sufficient flexibility to allow a wide variety of formats and drive types 
to be used. 

Both ST506/412 compatible Winchester and SA400/450 compatible floppy drives are supported in standard 
double density formats; this includes common 5.25 and 3.5 inch drives. Up to 4096 cylinders are allowed. 
Floppy drives can have up to 8 heads and Winchesters up to 16 heads. There can be between 1 and 256 
sectors per track, with sector sizes of 1 28 to 1 6384 bytes in powers of 2. Drives with or without 'seek complete' 
and 'ready' lines are supported, and step rates can be from 64 //s to 16 ms. A range of non-standard formats 
can also be set up for user-specific requirements. 

As with transputers, the IMS M212 can be bootstrapped from ROM or via a link. In addition, the Mode 1 
monitor process also provides a facility whereby the disk processor can bootstrap itself with code read from 
a disk; this code runs instead of the Mode 1 process. Another option sends a standard bootstrap message, 
read from a disk, out of link 0; the Mode 1 process then continues as normal. It is also possible in Mode 1 
to send a command, at any time, to bootstrap from code in the sector buffer. 

General workspace for Mode 1 is contained in on-chip RAM, which also provides 1280 bytes of sector buffer. 
Contiguous external RAM immediately past the internal RAM will automatically be used to extend the size of 
the sector buffer. As many sectors as will fit into the sector buffer can be stored in it at the same time. 

In Mode 1 a separate data area, in on-chip RAM, contains all the required control information (parameters) 
for each of the four possible drives. Parameters may be read from or written to via the links, and contain such 
information as the capacity of the disk, current position of the heads, desired sector for reading or writing, 
drive type, timing details etc. 

Command and data bytes are accepted down either of the IMS M212 links; an interlock system prevents 
conflict between commands received on both links simultaneously. Any results are returned on the link which 
received the command. Available commands are 



EndOfSequence 


Initialise 


ReadParameter 


Write Parameter 


ReadBuffer 


WriteBuffer 


ReadSector 


WriteSector 


Restore 


Seek 


SelectHead 


SelectDrive 


PoHDrives 


FormatTrack 


Boot 





Disk access commands implicitly select the drive, perform a seek and select the head. If an ECC or CRC 
error is found when reading a sector, a programmable number of automatic retries are performed and a 
subsequent correction attempted if possible. Mode 1 supports two of the four IMS M212 ECC/CRC modes - 
ECC and CRC. Either CRC or ECC can be specified in either of the ID or Data fields, making it possible to 
have floppies with correctable Data fields. 

All appropriate parameters are checked to ensure that, for example, an attempt is not made to access a 
non-existent sector, relieving the host processor of such checking. Another feature which reduces the load 
on the host processor is the logical sector mode, in which all the sectors are specified as a single linear 
address space rather than physical cylinder/head/sector. 

The logical address can also be auto-incremented if desired, as can the sector buffer. This allows a number 
of consecutive sectors to be read from or written to the disk with little overhead. As a sticky status checking 
technique is used, the status only has to be checked once at the end of a stream of commands; if an error 
occurred then reading and writing is inhibited, so that the logical address can be inspected to find where the 
error occurred. 



468 



11 IMS M21 2 preview 



2.2 



Mode 2 



In Mode 2 operation the internal ROM is bypassed, allowing the device to utilise user-defined software. This 
software can be held in external ROM, bootstrapped from a floppy or Winchester disk, or loaded fron^i the 
host processor via a link into internal or external RAM. 

In this mode the user services the disk control hardware via a pair of on-chip high bandwidth channels. Using 
these channels the processor has access to the 49 registers which control the operation of the disk controller. 
Sequences of control codes and data bytes are sent by the processor to the disk controller logic via one of 
the hardware channels and data returned to the IMS M212 processor via the other. Each control code is a 
single byte, and may be followed by one or more data bytes. 

In Mode 2 the designer can define new commands which are more complex than othen/vise available. Exam- 
ples include a Format Disk command as an extension to the Format Track; an application-specific directory 
structure; a software interface to optimise a particular file structure. Mode 2 also allows the user to optimise 
data transfer; thus, data could be read from a disk with no interleave, or data transfers could be re-ordered 
to minimise head movement. Disk searches can be arranged such that data transfer back to the host is 
minimised, as data comparisons can be performed by the on-chip processor. 
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Figure 2.1 Disk controller interface 
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Figure 3.1 Winchester disk controller 
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The IMS M212 can interface to a floppy or Winchester disk with very little external circuitry when used in 
Mode 1 or if a program is bootstrapped fronri a link. A typical arrangement is shown in figure 3.1. Note the 
absence of any control port buffers; this is possible provided the drive characteristics are not infringed. 

Additional external memory can easily be added to the IMS M212. In both Modes 1 and 2, external RAM can 
be added for extra sector storage, whilst in Mode 2 extra RAM or ROM can be provided for program storage. 

With the addition of control buffers and suitable clocks, a single IMS M212 can interface to both floppy and 
Winchester drives. Link adaptors provide a means of interfacing to conventional microprocessors. 



The IMS BOOS evaluation board is an example of an application with control for both types of drive, 
board also has a fully populated memory interface. 
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Figure 3.2 Enhanced disk controller interface 



The IMS M212 can interface with both floppy and Winchester disk drives, and the data rate to and from the 
disk can be selected by software. As a result the device is suitable for interfacing to the new generation of 
floppy disk drives which use vertical recording. These drives have an increased data rate of 1 Mbit/sec, and 
quadruple the capacity of existing floppy disk drives to 4 Mbytes. A single IMS M212 can be used to control 
a mixture of standard floppy drives, Winchester drives and the new high speed high capacity drives. This 
eases compatibility and portability problems, and provides a simple upgrade path from standard floppies to 
high capacity floppies to Winchesters. 



3 Applications 
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The IMS M212 provides a very simple and compact disk controller solution, making it very easy to replace 
a single large disk drive with an array of IMS M212's, each controlling a single smaller disk drive. This 
has several advantages: cheaper drives can be used; overall available disk bandwidth is increased; local 
processing is provided by a high performance processor at each disk node; fault tolerant operation. The 
latter can be achieved by holding duplicated data on several drives. This prevents the whole system from 
stopping, as would be the case if the single large drive failed. 

These advantages are particularly applicable when transputers are connected in arrays to provide high per- 
formance concurrent systems (figure 3.3). The IMS M212's can be directly connected to the array via INMOS 
links and the spare link used to communicate with the adjacent IMS M212 to provide the fault tolerant oper- 
ation. 
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Figure 3.3 Transputer network with disk processors 

A high performance processor allows many operations to be performed locally to the disk. This not only 
frees the host processor for other work but also removes the need for large amounts of data to be needlessly 
transferred to the host. Operations which can be performed by the IMS M212 include: file management 
with directory management and pre-reading; data manipulation such as compression/de-compression and 
encryption/de-cryption; data search such as database key searching; performance optimisation such as head 
scheduling and cacheing. 

The IMS M212 external memory interface can be used to connect to memory mapped peripherals. One 
application of this is interfacing to a SCSI bus controller, permitting direct connection to the SCSI bus in a 
low part count system. The processor is used to control the SCSI bus controller and implements the required 
command interface, as well as controlling the disk or other peripheral. 

This arrangement allows floppy and Winchester disks to be simply connected to a SCSI bus. Because the 
command interface Is controlled by a process running in the IMS M212, any future command upgrades can 
easily be incorporated. 
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The design can be used both as a target and an initiator interface, again controlled by the process running 
in the IMS M212. It provides a means of implementing a link to SCSI interface, as well as a SCSI controlled 
disk. 
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Figure 3.4 SCSI Interface 
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4 Package specifications 

4.1 68 pin grid array paclcage 
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Figure 4.1 IMS M212 68 pin grid array package pinout 
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Figure 4.2 68 pin grid array package dimensions 



Table 4.1 68 pin grid array package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B 


17.019 


±0.127 


0.670 


±0.008 




C 


2.466 


±0.279 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.051 


0.018 


±0.002 


Pin diameter 


G 


1.270 


±0.127 


0.050 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Package weight is approximately 6.8 grams 

Table 4.2 68 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 
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4.2 



68 pin PLCC J-bend package 
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3 53 EventAck 
1 52 HoldToGND 
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Figure 4.3 IMS M212 68 pin PLCC J-bend package pinout 
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Figure 4.4 68 pin PLCC J-bend package dimensions 



Table 4.3 68 pin PLCC J-bend package dimensions 



DIM 


Miilimetres 


inches 


Notes 


NOM 


TOL 


NOfVI 


TOL 


A 


25.146 


±0.127 


0.990 


±0.005 




B 


24.232 


±0.127 


0.954 


±0.005 




C 


3.810 


±0.127 


0.150 


±0.005 




D 


0.508 


±0.127 


0.020 


±0.005 




F 


1.270 


±0.127 


0.050 


±0.005 




G 


0.457 


±0.127 


0.018 


±0.005 




J 


0.000 


±0.051 


0.000 


±0.002 




K 


0.457 


±0.127 


0.018 


±0.005 




L 


0.762 


±0.127 


0.030 


±0.005 





Package weight is approximately 5.0 grams 

Table 4.4 68 pin PLCC J-bend package junction to ambient thermal resistance 



SYl^nSOL 


PARAiVIETER 


I^IN 


NOiVI 


MAX 


UNITS 


NOTE 


^JA At 400 linear ft/min transverse air flow 




35 




°c/w 
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5 Ordering 

This section indicates the designation of speed and package selections for the various devices. Speed of 
Clockln is 5 MHz for all parts. Transputer processor cycle time is nominal; it can be calculated more exactly 
using the phase lock loop factor PLLx, as detailed in the external memory section. 

For availability contact local INMOS sales office or authorised distributor. 



Table 5.1 IMS M212 ordering details 



INMOS 
designation 


Processor 
ciocl( speed 


Processor 
cycle time 


PLLx 


Package 


IIVIS M212-G15S 
IIVIS M212-G20S 

IIVIS M212-J15S 
MS [VI212.J20S 


15 MHz 
20 MHz 

15 MHz 
20 MHz 


67 ns 
50 ns 

67 ns 
50 ns 


3.0 
4.0 

3.0 
4.0 


Ceramic Pin Grid 
Ceramic Pin Grid 

Plastic PLCC J-Bend 
Plastic PLCC J-Bend 
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IMS C004 

programmable 
ink switch 



mmos 



Engineering Data 



FEATURES 

standard INMOS serial links 
32 way crossbar switch 
Regenerates input signal 
Cascadable to any depth 
No loss of signal integrity 
1 or 20 Mbits/sec operating speed 
Separate INMOS configuration link 
Single +5V ±5% power supply 
TTL and CMOS compatibility 
1W power dissipation 
Standard 84 pin ceramic PGA 
MIL-STD-883C device is available 

APPLICATIONS 

Programmable crossbar switch 

Component of larger switch 

Reconfigurable supercomputers 

Message routing system 

High speed multiprocessor systems 

Telecommunications 

Robotics 

Fault tolerant systems 

Additional links for transputers 
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1 Introduction 

The INMOS communication link is a high speed system interconnect which provides full duplex communication 
between members of the INMOS transputer family, according to the INMOS serial link protocol. The IMS C004, 
a member of this family, is a transparent programmable link switch designed to provide a full crossbar switch 
between 32 link inputs and 32 link outputs. 

The IMS C004 will switch links running at either the standard speed of 10 Mbits/sec or at the higher speed 
of 20 Mbits/sec. It Introduces, on average, only a 1 .75 bit time delay on the signal. Link switches can be 
cascaded to any depth without loss of signal integrity and can be used to construct reconfigurable networks 
of arbitrary size. The switch is programmed via a separate serial link called the cx)nfiguration link. 

All INMOS products which use communication links, regardless of device type, support a standard commu- 
nications frequency of 10 Mbits/sec; most products also support 20 Mbits/sec. Products of different type 
or performance can, therefore, be interconnected directly and future systems will be able to communicate 
directly with those of today. 
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Figure 1.1 IMS C004 block diagram 
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Pin designations 



Table 2.1 IMS C004 system services 



Pin 


In/Out 


Function 


VCC, GND 




Power supply and return 


CapPlus, CapMinus 




External capacitor for internal clock power supply 


Clockln 


in 


Input clock 


Reset 


in 


System reset 


DoNotWire 




Must not be wired 



Table 2.2 IMS C004 configuration 



Pin 


In/Out 


Function 


ConfigLinkln 
ConfigLinkOut 


in 
out 


INMOS configuration link input 
INMOS configuration link output 



Table 2.3 IMS C004 link 



Pin 


In/Out 


Function 


LinklnO-31 

LinkOutO-31 

LinkSpeed 


in 

out 
in 


INMOS link inputs to the switch 
INMOS link outputs from the switch 
Link speed selection 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 498. 
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3 System services 

System services Include all the necessary logic to start up and maintain the IMS C004. 



3.1 



Power 



Power is supplied to the device via the VCC and GND pins. Several of each are provided to minimise 
inductance within the package. All supply pins must be connected. The supply must be decoupled close to 
the chip by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer 
boards are recommended; if two layer boards are used, extra care should be taken in decoupling. 

Input voltages must not exceed specification with respect to VCC and GND, even during power-up and power- 
down ramping, othenA^ise latchup can occur. CMOS devices can be permanently damaged by excessive 
periods of latchup. 



3.2 



CapPlus, CapMinus 



The internally derived power supply for internal clocks requires an external low leakage, low inductance 1/iF 
capacitor to be connected between CapPlus and CapMinus. A ceramic capacitor is preferred, with an 
impedance less than 3 Ohms between 100 KHz and 10 MHz. If a polarised capacitor is used the negative 
terminal should be connected to CapMinus. Total PCB track length should be less than 50 mm. The 
connections must not touch power supplies or other noise sources. 
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Figure 3.1 Recommended PLL decoupling 



3.3 



Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer family devices may be connected to a common clock, or may have individual clocks 
providing each one meets the specified stability criteria. In a multi-clock system the relative phasing of 
Clockln clocks is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant 
provided the specified limits of Clockln pulse widths are met. 

Oscillator stability is Important. Clockln must be derived from a crystal oscillator; RC oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 



3 System services 
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Table 3.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 




ns 


1,2,4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1,3 


TDC1 DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1.4 


TDCr 


Clockln rise time 






10 


ns 


1.5 


TDCf 


Clockln fall time 






8 


ns 


1.5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 7.3). 
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Figure 3.2 Clockln timing 
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12 [MS C004 engineering data 



3.4 



Reset 



The Reset pin can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. 
After VCC is valid Clockln should be running for a minimum period TDCVRL before the end of Reset. 

Reset initialises the IMS C004 to a state where all link outputs from the switch are disconnected and held 
low; the control link is then ready to receive a configuration message. 



Table 3.2 Reset 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 
TRHRL 
TDCVRL 


Powe'- valid before Reset 

Reset pulse width high 

Clockln running before Reset end 


10 
8 
10 






ms 

Clockln 

ms 


1 
2 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 
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Figure 3.3 Reset Timing 
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4 Links 

INMOS bi-directional serial links provide synchronized communication between INMOS products and with 
the outside world. Each link comprises an input channel and output channel. A link between two devices is 
implemented by connecting a link interface on one device to a link interface on the other device. Every byte 
of data sent on a link is acknowledged on the input of the same link, thus each signal line carries both data 
and control information. 

A receiver can transmit an acknowledge as soon as it starts to receive a data byte. In this way the transmission 
of an acknowledge can be overlapped with receipt of a data byte to provide continuous transmission of data. 
This technique is fully compatible with all other INMOS transputer family links. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a one 
bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted first. 
After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit followed 
by a zero bit. The acknowledge signifies that the receiving link is able to receive another byte. 

Links are not synchronised with Clockln and are insensitive to its phase. Thus links from independently 
clocked systems may communicate, providing only that the clocks are nominally identical and within specifi- 
cation. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

The IMS C004 links support the standard INMOS communication speed of 10 Mbits/sec. In addition they 
can be used at 20 Mbits/sec. When the LinkSpeed pin is low, all links operate at the standard 10 Mbits/sec; 
when high they operate at 20 Mbits/sec. 

A single IMS C004 inserted between two transputers which fully implement overlapped acknowledges will 
cause some reduction in data bandwidth, see table 4.2 and figure 4.7. 
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Figure 4.1 IMS C004 link data and acknowledge packets 
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Table 4.1 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




10Mbits/s 






10 


ns 


2 


CLIZ 


Linkin capacitance @ f=1 MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for 1 0On transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 100% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 




Figure 4.2 IMS C004 link timing 
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Figure 4.3 IMS C004 buffered link timing 



4 Links 
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Figure 4.4 IMS C004 Links directly connected 
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Figure 4.5 IMS C004 Links connected by transmission line 
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Figure 4.6 IMS C004 Links connected by buffers 
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Table 4.2 T800 links data transfer rate at 20 Mbits/sec 





Without C004 


Witil C004 


Degradation 


Unidirectional 
Bidirectional 


1.7 
2.3 


1.3 
2.1 


25% 
10% 
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Figure 4.7 IMS C004 link timing 



Notes 



1 All values are in ns. 

2 Timing values shown are for links at 20 Mbits/sec. 
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5 Switch implementation 

The IMS C004 is internally organised as a set of thirtytwo 32-to-1 multiplexors. Each multiplexor has asso- 
ciated with it a six bit latch, five bits of which select one Input as the source of data for the corresponding 
output. The sixth bit is used to connect and disconnect the output. These latches can be read and written 
by messages sent on the configuration link via ConfigLinkIn and ConfigLinkOut. 

The output of each multiplexor is synchronised with an internal high speed clock and regenerated at the 
output pad. This synchronisation introduces, on average, a 1 .75 bit time delay on the signal. As the signal is 
not electrically degraded in passing through the switch, it is possible to form links through an arbitrary number 
of link switches. 

Each input and output is identified by a number in the range to 31. A configuration message consisting 
of one, two or three bytes Is transmitted on the configuration link. The configuration messages sent to the 
switch on this link are shown in table 5.1. If an unspecified configuration message is used, the effect of it is 
undefined. 



Table 5.1 IMS C004 configuration messages 



Configuration Message 


Function 


[0] [input] [output] 
[1] [Iink1] [Iink2] 

[2] [output] 
[3] 

[4] 

[5] [output] 

[6] [Iink1] [tink2] 


Connects input to output. 

Connects Iink1 to iink2 by connecting the input of Iink1 to the output of 
Iink2 and the input of Iink2 to the output of linkl. 

Enquires which input the output is connected to. The IMS C004 responds 
with the input. The most signifigant bit of this byte indicates whether the 
output is connected (bit set high) or disconnected (bit set low). 

This command byte must be sent at the end of every configuration 
sequence which sets up a connection. The IMS C004 is then ready to 
accept data on the connected inputs. 

Resets the switch. All outputs are disconnected and held low. This also 
happens when Reset is applied to the IMS C004. 

Output output is disconnected and held low. 

Disconnects the output of linkl and the output of Iink2. 
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6 Applications 

6.1 Link switching 

The IMS C004 provides full switching capabilities between 32 INMOS links. It can also be used as a compo- 
nent of a larger link switch. For example, three IMS C004's can be connected together to produce a 48 way 
switch, as shown in figure 6.1. This technique can be extended to the switch shown in figure 6.2. 

A fully connected network of 32 INMOS transputers (one in which all four links are used on every transputer) 
can be completely configured using just four IMS C004's. Figure 6.5 shows the connected transputer network. 

In these diagrams each link line shown represents a unidirectional link; i.e. one output to one input. Where 
a number is also given, that denotes the number of lines. 

6.2 IVIuItiple IMS C004 control 

Many systems require a number of IMS C004's, each configured via its own configuration link. A simple 
method of implementing this uses a master IMS C004, as shown in figure 6.3. One of the transputer links is 
used to configure the master link switch, whilst another transputer link is multiplexed via the master to send 
configuration messages to any of the other 31 IMS C004 links. 

6.3 Bidirectional exchange 

Use of the IMS C004 is not restricted to computer configuration applications. The ability to change the switch 
setting dynamically enables it to be used as a general purpose message router. This may, of course, also 
find applications in computing with the emergence of the new generation of supercomputers, but a more 
widespread use may be found as a communication exchange. 

In the application shown in figure 6.4, a message into the exchange must be preceded by a destination 
token dest When this message is passed, the destination token is replaced with a source token so that 
the receiver knows where the message has come from. The In.out device in the diagram and the controller 
can be implemented easily with a transputer, and the link protocol for establishing communication with these 
devices can be interfaced with INMOS link adaptors. All messages from rx[l] are preceded by the destination 
output dest. On receipt of such a message the in.out device requests the controller to connect a bidirectional 
link path to dest. The controller determines what is currently connected to each end of the proposed link. 
When both ends are free it sets up the IMS C004 and informs both ends of the new link. Note that In this 
network two channels are placed on each IMS C004 link, one for each direction. 

6.4 Bus systems 

The IMS C004 can be used in conjunction with the INMOS IMS C01 1/C012 link adaptors to provide a flexible 
means of connecting conventional bus based microprocessor systems. 
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Figure 6.1 48 way link switch 




Figure 6.2 Generalised link switch 
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Figure 6.3 Multiple IMS C004 cx)ntroller 
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Figure 6.4 32 way bidirectional exchange 
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Figure 6.5 Complete cx)nnectivity of a transputer network using four IMS C004's 
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7 Electrical specifications 

7.1 DC electrical characteristics 



Table 7.1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


vcc 


DC supply voltage 





7.0 


V 


1,2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1,2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




2 


W 





Notes 



1 All voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 7.2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 


4.75 


5.25 


V 


1 


VI, vo 


Input or output voltage 





VCC 


V 


1,2 


CL 


Load capacitance on any pin 




60 


PF 




TA 


Operating temperature range IMS C004-S 





70 


°C 


3 


TA 


Operating temperature range IMS C004-M 


-55 


125 


°C 


3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 7.3 DC characteristics 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 


2.0 


VCC+0.5 


V 


1,2 


VIL 


Low level input voltage' 


-0.5 


0.8 


V 


1.2 


II 


Input current @ GND<VI<VCC 




±10 


/iA 


1.2 


VOH 


Output high voltage @ IOH=2mA 


VCC-1 




V 


1,2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1,2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1.2,3,6 






65 


100 


mA 


1,2,4,6 


PD 


Power dissipation 




1.5 


W 


2.5 


CIN 


Input capacitance @ f=1 MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=1 MHz 




10 


PF 


6 



Notes 



1 Ail voltages are with respect to GND. 

2 Parameters for IMS C004-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading and with the number of links active. 

6 This parameter is sampled and not 100% tested. 



7.2 



Equivalent circuits 



Output 



VCC 




Load for: 


R1 


R2 


Equivalent load: 


Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 SchottkyTTL input 

2 Schottky TTL inputs 



Diodes are 1N916 



Figure 7.1 Load circuit for AC measurements 
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Figure 7.2 AC measurements timing waveforms 
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Figure 7.3 Typical link rise/fall times 



7 Electrical specifications 



497 



7.4 Power rating 

Internal power dissipation Pjnt of transputer and peripheral chips depends on VCC, as shown in figure 7.4. 
Pint is substantially Independent of temperature. 

Total power dissipation Pd of the chip is 

Pd = Pint + -Pro 
where Pjo is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj:=:Ta+OJa*Pd 

where Ta is the external ambient temperature in °C and $Ja is the junction-to-ambient thermal resistance in 
°C/W. OJa for each package is given in the Packaging Specifications section. 
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Figure 7.4 IMS C004 internal power dissipation vs VCC 
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8 Package specifications 

8.1 84 pin grid array package 



10 




Link 
In16 



Figure 8.1 IMS C004 84 pin grid array package pinout 
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Figure 8.2 84 pin grid array package dimensions 



Table 8.1 84 pin grid array package dimensions 



DIIVI 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


26.924 


±0.254 


1.060 


±0.010 




B 


17.019 


±0.127 


0.670 


±0.005 




C 


2.456 


±0.278 


0.097 


±0.011 




D 


4.572 


±0.127 


0.180 


±0.005 




E 


3.302 


±0.127 


0.130 


±0.005 




F 


0.457 


±0.025 


0.018 


±0.002 


Pin diameter 


G 


1.143 


±0.127 


0.045 


±0.005 


Flange diameter 


K 


22.860 


±0.127 


0.900 


±0.005 




L 


2.540 


±0.127 


0.100 


±0.005 




M 


0.508 




0.020 




Chamfer 



Package weigiit is approximately 7.2 grams 

Table 8.2 84 pin grid array package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 






35 


°c/w 





500 
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8.2 84 lead quad cerpack package 

The leads are unformed to allow the user to form them to specific requirements. 



LInkOut? 75 

LinkOute 76 

LInkOutS 77 

LinkOut4 78 

VCC79 

GND80 

LinkOuta 81 

LinkOut2 82 

LInkOutI 83 

LinkOutO 84 

LinkOutai 1 

LinkOutSO 2 

LinkOut29 3 

LinkOut28 4 

LinkOut27 5 

VCC 6 

GND 7 

LinkOut26 8 

LinkOut25 9 
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ConfigLinkOut 11 
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d53 LinkOutS 
d52 LlnkOut9 
D51 LinkOunO 
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=149 VCC 
D 48 GND 
D47LinkOut12 
D46LlnkOut13 
=145 LInkOutI 4 
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Figure 8.3 IMS C004 84 lead quad cerpack package pinout 
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Figure 8.4 84 lead quad cerpack package dimensions 



DIIVI 


Millimetres 


Inches 


Notes 


NOIVI 


TOL 


NOM 


TOL 


A 


38.100 


±0.508 


1.500 


±0.020 




B 


26.924 


d=0.305 


1.060 


±0.012 




C 


20.574 


±0.203 


0.810 


±0.008 




D 


19.558 


±0.254 


0.770 


±0.010 




E 


0.508 




0.020 






F 


1.270 


±0.051 


0.050 


±0.002 




G 


2.489 


±0.305 


0.098 


±0.012 




H 


0.635 


±0.076 


0.025 


±0.003 




J 


1.143 


±0.102 


0.045 


±0.004 




K 


3.099 




0.122 




Max. 


L 


27.940 




1.100 




Max. 


M 


0.178 


±0.025 


0.007 


±0.001 





Table 8.3 84 lead quad cerpack package dimensions 
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9 Ordering 

This section indicates the designation of package selections for the IMS C004. Speed of Clockln is 5 MHz 
for all parts. 

For availability contact local INMOS sales office or authorised distributor. 

Table 9.1 IMS C004 ordering details 



INMOS designation 


Package 


IMS C004-G20S 

IMS C004-G20M 
IMS C004-Q20M 


Ceramic Pin Grid Array 

Ceramic Pin Grid Array MIL Spec 
Quad Cerpack MIL Spec 
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Engineering Data 



FEATURES 

standard INMOS link protocol 
10 or 20 Mbits/sec operating speed 
Communicates with INMOS transputers 
Converts between serial link and parallel bus 
Converts between serial link and parallel device 

Two modes of parallel operation: 
Mode 1: Peripheral interface 

Eight bit parallel input interface 
Eight bit parallel output interface 
Full handshake on input and output 

Mode 2: Bus interface 

Tristate bidirectional bus interface 
Memory mapped registers 
Interrupt capability 

Single +5V ±5% power supply 
TTL and CMOS compatibility 
120mW power dissipation 
Standard 28 pin 0.6" plastic package 
MIL-STD-883C device is available 

APPLICATIONS 

Programmable I/O pins for transputer 
Connecting microprocessors to transputers 
High speed links between microprocessors 
Inter-family microprocessor interfacing 
Interconnecting different speed links 
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1 Introduction 

The INMOS communication link is a high speed system interconnect which provides full duplex communication 
between members of the INMOS transputer family, according to the INMOS serial link protocol. The IMS C01 1 , 
a member of this family, provides for full duplex transputer link communication with standard microprocessor 
and sub-system architectures, by converting bi-directional serial link data into parallel data streams. The 
extended temperature version of the device complies with MIL-STD-883C. 

All INMOS products which use communication links, regardless of device type, support a standard commu- 
nications frequency of 10 Mbits/sec; most products also support 20 Mbits/sec. Products of different type 
or performance can, therefore, be interconnected directly and future systems will be able to communicate 
directly with those of today. The IMS C01 1 link will run at either the standard speed of 1 Mbits/sec or at the 
higher speed of 20 Mbits/sec. Data reception is asynchronous, allowing communication to be independent 
of clock phase. 

The link adaptor can be operated in one of two modes. In Mode 1 the IMS C011 converts between a link 
and two independent fully handshaken byte-wide interfaces, one input and one output. It can be used by a 
peripheral device to communicate with a transputer, an INMOS peripheral processor or another link adaptor, 
or it can provide programmable input and output pins for a transputer. Two IMS C01 1 devices in this mode 
can be connected back to back via the parallel ports and used as a frequency changer between different 
speed links. 

In Mode 2 the IMS C011 provides an interface between an INMOS serial link and a microprocessor system 
bus. Status and data registers for both input and output ports can be accessed across the byte-wide bi- 
directional interface. Two interrupt outputs are provided, one to indicate input data available and one for 
output buffer empty. 
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Figure 1.1 IMS C011 Mode 1 block diagram 
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Figure 1.2 IMS C011 Mode 2 block diagram 
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Pin designations 



Table 2.1 IMS C011 services and link 



Pin 


In/Out 


Function 


VCC, GND 
CapMinus 
Clockln 
Reset 

SeparatelQ 

Linkin 

LinkOut 


in 
in 

in 
in 
out 


Power supply and return 

External capacitor for internal clock power supply 

Input clock 

System reset 

Select mode and Mode 1 link speed 
Serial data input channel 
Serial data output channel 



Table 2.2 IMS C011 Mode 1 parallel interface 



Pin 


In/Out 


Function 


10-7 

IValid 

lAck 

QO-7 

QValid 

QAck 


in 
in 
out 

out 
out 
in 


Parallel input bus 

Data on 10-7 is valid 

Acknowledge 10-7 data received by other link 

Parallel output bus 

Data on QO-7 is valid 

Acknowledge from device: data QO-7 was read 



Table 2.3 IMS C011 Mode 2 parallel interface 



Pin 


In/Out 


Function 


DO-7 
notes 
RSO-1 
RnotW 


in/out 
in 
in 
in 


Bi-directional data bus 
Chip select 
Register select 
Read/write control signal 


Inputint 

Outputint 

LinkSpeed 


out 
out 
in 


Interrupt on link receive buffer full 
Interrupt on link transmit buffer empty 
Select link speed as 10 or 20 Mbits/sec 


HoldToGND 
DoNotWire 




Must be connected to GND 
Must not be wired 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 524. 
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3 System services 

System services include all the necessary logic to start up and maintain the IMS C01 1 . 

3.1 Power 

Power is supplied to the device via the VCC and GND pins. The supply must be decoupled close to the chip 
by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer boards 
are recommended; if two layer boards are used, extra care should be taken in decoupling. 

AC noise between VCC and GND must be kept below 200 mV peak to peak at all frequencies above 1 00 KHz. 
AC noise between VCC and the ground reference of load capacitances must be kept below 200 mV peak to 
peak at all frequencies above 30 MHz. Input voltages must not exceed specification with respect to VCC and 
GND, even during power-up and power-down ramping, otherwise latchup can occur. CMOS devices can be 
permanently damaged by excessive periods of latchup. 

3.2 CapMinus 

The internally derived power supply for internal clocks requires an external low leakage, low inductance VF 
capacitor to be connected between VCC and CapMinus. A ceramic capacitor is preferred, with an impedance 
less than 3 Ohms between 1 00 KHz and 1 MHz. If a polarised capacitor is used the negative terminal should 
be connected to CapMinus. Total PCB track length should be less than 50 mm. The positive connection of 
the capacitor must be connected directly to VCC. Connections must not otherwise touch power supplies or 
other noise sources. 
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Figure 3.1 Recommended PLL decoupling 



3.3 Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer family devices may be connected to a common clock, or may have individual clocks 
providing each one meets the specified stability criteria. In a multi-clock system the relative phasing of 
Clockln clocks is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant 
provided the specified limits of Clockln pulse widths are met. 

Oscillator stability is important. Clockln must be derived from a crystal oscillator; RC oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 
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Table 3.1 Input clock 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 


400 


ns 


1,2.4 


TDCerror 


Clockln tinning error 






±0.5 


ns 


1.3 


TDC1 DC2 


Difference In Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise time 






10 


ns 


1.5 


TDCf 


Clockln fall time 






8 


ns 


1,5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of Individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 7.3). 
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Figure 3.2 Clockln timing 



3.4 



SeparatelQ 



The IMS coil link adaptor has two different modes of operation. Mode 1 is basically a link to peripheral 
adaptor, whilst Mode 2 interfaces between a link and a microprocessor bus system. 

Mode 1 can be selected for one of two link speeds by connecting SeparatelQ to VCC (10 Mbits/sec) or to 
Clockln (20 Mbits/sec). 

Mode 2 is selected by connecting SeparatelQ to GND; in this mode 10 Mbits/sec or 20 Mbits/sec is selected 
by LinkSpeed. Link speeds are specified for a Clockln frequency of 5 MHz. 

In order to select the link speed, SeparatelQ may be changed dynamically providing the link Is in a quiescent 
state and no input or output is required. Reset must be applied subsequent to the selection to initialise 
the device. If Clockln is gated to achieve this, its skew must be limited to the value TDCHSIQH shown in 
table 3.3. The mode of operation (Mode 1 , Mode 2) must not be changed dynamically. 
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Table 3.2 SeparatelQ mode selection 



SeparatelQ 


Mode 


Link Speed Mbits/sec 


VCC 

Clockln 

GND 


1 
1 
2 


10 

20 

10 or 20 



Table 3.3 SeparatelQ 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TDCHSIQH 


Skew from Clockln to Clockln 






20 


ns 


1 



Notes 



3.5 



1 Skew between Clockln arriving on the Clockln pin and on the SeparatelQ pin. 



Reset 



The Reset pin can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. 
After VCC is valid Clockln should be running for a minimum period TDCVRL before the end of Reset. LinkIn 
must be held low during Reset. 

Reset initialises the IMS C011 to the following state: LlnkOut is held low; the control outputs (lAck and 
QValld in Mode 1, Inputint and Outputint in Mode 2) are held low; interrupts (Mode 2) are disabled; the 
states of QO-7 in Mode 1 are unspecified; DO-7 in Mode 2 are high impedance. 



Table 3.4 Reset 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 
TRHRL 
TDCVRL 


Power valid before Reset 

Reset pulse width high 

Clockln running before Reset end 


10 
8 

10 






ms 

Clockln 

ms 


1 
2 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 
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Figure 3.3 Reset Timing 
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4 Links 

INMOS bi-directional serial links provide synchronized communication between INMOS products and with 
the outside world. Each link comprises an input channel and output channel. A link between two devices is 
implemented by connecting a link interface on one device to a link interface on the other device. Every byte 
of data sent on a link is acknowledged on the input of the same link, thus each signal line carries both data 
and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a 
one bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted 
first. After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit 
followed by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged 
data byte and that the receiving link is able to receive another byte. 

Links are not synchronised with Clockln and are insensitive to its phase. Thus links from independently 
clocked systems may communicate, providing only that the clocks are nominally identical and within specifi- 
cation. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

The IMS C01 1 link supports the standard INMOS communication speed of 10 Mbits/sec. In addition it can be 
used at 20 Mbits/sec. Link speed can be selected in one of two ways. In Mode 1 it is altered by SeparatelQ 
(page 507). In Mode 2 it is selected by LinkSpeed; when the LinkSpeed pin is low, the link operates at the 
standard 10 Mbits/sec; when high it operates at 20 Mbits/sec. 





H L, 
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H'H 01 234567L, 
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Figure 4.1 IMS C011 link data and acknowledge packets 



510 



13 IMS C011 engineering data 



Table 4.1 Link 



SYMBOL 


PARAMETER 


MiN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




lOMbits/s 






10 


ns 


2 


CLIZ 


Linkin capacitance @ f=1 MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for lOOn transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 1 00% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 




Figure 4.2 IMS C01 1 link timing 
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Figure 4.3 IMS C01 1 buffered link timing 



4 Links 
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Figure 4.4 IMS C01 1 Links directly connected 
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Figure 4.5 IMS C011 Links connected by transmission line 
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Figure 4.6 IMS C01 1 Links connected by buffers 
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5 Mode 1 parallel interface 

In Mode 1 the IMS C011 link adaptor is configured as a parallel peripheral interface with handshake lines. 
Communication with a transputer family device is via the serial link. The parallel interface comprises an input 
port and an output port, both with handshake. 



5.1 



Input port 



The eight bit parallel input port 10-7 can be read by a transputer family device via the serial link. IValid and 
lAck provide a simple two-wire handshake for this port. When data is valid on 10-7, IValid is taken high by 
the peripheral device to commence the handshake. The link adaptor transmits data presented on 10-7 out 
through the serial link. When the acknowledge packet is received on the input link, the IMS C01 1 sets lAck 
high. To complete the handshake, the peripheral device must return IValid low. The link adaptor will then 
set lAck low. New data should not be put onto 10-7 until lAck is returned low. 

Table 5.1 Mode 1 parallel data input 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TIdVlvH 

TlvHLdV 

TLaVlaH 

TIaHldX 

TIaHlvL 

TlvLlaL 

TIaLlvH 


Data setup 

IValid high to link data output 

Link acknowledge start to lAck high 

Data hold after lAck high 

IValid hold after lAck high 

lAck hold after IValid low 

Delay before next IValid high 


5 
0.8 




1 





2 
3 

4 


ns 
bits 
bits 
ns 
ns 
bits 
ns 


1,2 
1 

1 



Notes 



1 Unit of measurement is one link data bit time; at 10 Mbits/s data link speed, one bit time Is nominally 100 ns. 

2 Maximum time assumes there is no acknowledge packet already on the link. Maximum time with acknowledge 
on the link is extended by 2 bits. 
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Figure 5.1 IMS C011 Mode 1 parallel data input to link adaptor 



5 Mode 1 parallel interface 
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5.2 



Output port 



The eight bit parallel output port QO-7 can be controlled by a transputer family device via the serial link. 
QValid and QAck provide a simple two-wire handshake for this port. 

A data packet received on the input link is presented on QO-7; the link adaptor then takes QValid high to 
initiate the handshake. After reading data from QO-7, the peripheral device sets QAck high. The IMS C011 
will then send an acknowledge packet out of the serial link to indicate a completed transaction and set QValid 
low to complete the handshake. 



Table 5.2 Mode 1 parallel data output 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TLdVQvH 
TQdVQvH 
TQvHQaH 
TQaHQvL 
TQaHLaV 
TQvLQaL 
TQvLQdX 


Start of link data to QValid 
Data setup 

QAck setup time from QValid high 
QAck high to QValid low 
QAck high to Ack on link 
QAck hold after QValid low 
Data hold 


11.5 
15 


1.8 

0.8 



11 




2 


bits 
ns 
ns 
bits 
bits 
ns 
bits 


1 
2 

1 
1,3 

1,4 



Notes 



1 Unit of measurement is one link data bit time; at 10 Mbits/s data link speed, one bit time is nominally 100 ns. 

2 Where an existing data output bit is re-written with the same level there will be no glitch in the output level. 

3 Maximum time assumes there is no data packet already on the link. Maximum time with data on the link is 
extended by 1 1 bits. 

4 Data output remains valid until oven/vritten by new data. 
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Figure 5.2 IMS C01 1 Mode 1 parallel data output from link adaptor 
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6 Mode 2 Parallel interface 

The IMS C011 provides an interface between a link and a microprocessor style bus. Operation of the link 
adaptor is controlled through the parallel interface bus lines DO-7 by reading and writing various registers in 
the link adaptor. Registers are selected by RSO-1 and RnotW, and the chip enabled with notCS. 

For convenience of description, the device connected to the parallel side of the link adaptor is presumed to 
be a microprocessor, although this will not always be the case. 



6.1 



DO-7 



Data is communicated between a microprocessor bus and the link adaptor via the bidirectional bus lines DO-7. 
The bus is high impedance unless the link adaptor chip is selected and the RnotW line is high. The bus is 
used by the microprocessor to access status and data registers. 



6.2 



notes 



The link adaptor chip is selected when notCS is low. Register selectors RSO-1 and RnotW must be valid 
before notCS goes low; DO-7 must also be valid if writing to the chip (RnotW low). Data is read by the link 
adaptor on the rising edge of notCS. 



6.3 



RnotW 



RnotW, in conjunction with notCS, selects the link adaptor registers for read or write mode. When RnotW 
is high, the contents of an addressed register appear on the data bus DO-7; when RnotW is low the data 
on DO-7 is written into the addressed register. The state of RnotW is latched into the link adaptor by notCS 
going low; it may be changed before notCS returns high, within the timing restrictions given. 



6.4 



RSO-1 



One of four registers is selected by RSO-1. A register Is addressed by setting up RSO-1 and then taking 
notes low; the state of RnotW when notCS goes low determines whether the register will be read or written. 
The state of RSO-1 is latched into the link adaptor by notCS going low; it may be changed before notCS 
returns high, within the timing restrictions given. The register set comprises a read-only data input register, 
a write-only data output register and a read/write status register for each. 

Table 6.1 IMS CG1 1 Mode 2 register selection 



RSI 


RSO 


RnotW 


Register 








1 


Read data 











Invalid 





1 


1 


Invalid 





1 





Write data 


1 





1 


Read input status 


1 








Write input status 


1 


1 


1 


Read output status 


1 


1 





Write output status 



6.4.1 Input Data Register 

This register holds the last data packet received from the serial link. It never contains acknowledge packets. 
It contains valid data only whilst the data present flag is set in the input status register. It cannot be assumed 
to contain valid data after it has been read; a double read may or may not return valid data on the second 
read. If data present is valid on a subsequent read it indicates new data is in the buffer. Writing to this register 
will have no effect. 



Mode 2 Parallel interface 
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Table 6.2 IMS C01 1 Mode 2 parallel interface control 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TRSVCSL 


Register select setup 


5 






ns 




TCSLRSX 


Register select hold 


5 






ns 




TRWVCSL 


Read/write strobe setup 


5 






ns 




TCSLRWX 


Read/write strobe hold 


5 






ns 




TCSLCSH 


Chip select active 


50 






ns 




TCSHCSL 


Delay before re-assertion of chip select 


50 






ns 





Table 6.3 IMS COHMode 2 parallel interface read 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TLdVIIH 
TCSLIIL 
TCSLDrX 


Start of link data to InputInt high 
Chip select to InputInt low 
Chip select to bus active 


5 




13 
30 


bits 
ns 
ns 


1 


TCSLDrV 


Chip select to data valid 






40 


ns 




TCSHDrZ 
TCSHDrX 
TCSHLaV 


Chip select high to bus tristate 
Data hold after chip select high 
Chip de-select to start of Ack 


5 
0.8 




25 
2 


ns 
ns 
bits 


1.2 



Notes 



1 Unit of measurement is one link data bit time; at 10 Mbits/s data link speed, one bit time Is nominally 100 ns. 

2 Maximum time assumes there is no data packet already on the link. Maximum time with data on the link is 
extended by 1 1 bits. 
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Figure 6.1 IMS C011 Mode 2 read parallel data from link adaptor 
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Table 6.4 IMS C011 Mode 2 parallel interface write 



SYiyiBOL 


PARAIVIETER 


MIN 


NOIVI 


MAX 


UNITS 


NOTE 


TCSHDwV 


Data setup 


15 






ns 




TCSHDwX 


Data hold 


5 






ns 




TCSLOIL 


Chip select to Outputint low 






30 


ns 




TCSHLdV 


Chip select high to start of link data 


0.8 




2 


bits 


1,2 


TLaVOIH 


Start of link Ack to Outputint high 






3 


bits 


1.3 


TLdVOIH 


Start of link data to Outputint high 






13 


bits 


1,3 



Notes 



1 Unit of measurement is one link data bit time; at 10 Mbits/s data link speed, one bit time is nominally 100 ns. 

2 Maximum time assumes there is no acknowledge packet already on the link. Maximum time with acknowledge 
on the link is extended by 2 bits. 

3 Both data transmission and the returned acknowledge must be completed before Outputint can go high. 
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Figure 6.2 IMS C01 1 Mode 2 write parallel data to link adaptor 
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6.4.2 Input Status Register 

This register contains the data present flag and the interrupt enable control bit for Inputlnt. The data present 
flag is set to Indicate that data in the data Input buffer is valid. It Is reset low only when the data Input buffer 
Is read, or by Reset. When writing to this register, the data present bit must be written as zero. 

The interrupt enable bit can be set and reset by writing to the status register with this bit high or low re- 
spectively. When the interrupt enable and data present ilags are both high, the Inputlnt output will be high 
(page 517). Resetting interrupt enable will take Inputlnt low; setting It again before reading the data input 
register will set Inputlnt high again. The interrupt enable bit can be read to deternriine its status. 

When writing to this register, bits 2-7 must be written as zero; this ensures that they will be zero when the 
register Is read. Failure to write zeroes to these bits may result in undefined data being returned by these 
bits during a status register read. 
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Figure 6.3 IMS C01 1 input status register 

6.4.3 Output Data Register 

Data written to this link adaptor register is transmitted out of the serial link as a data packet. Data should 
only be written to this register when the output ready bit in the output status register Is high, otherwise data 
already being transmitted may be corrupted. Reading this register will result in undefined data being read. 

6.4.4 Output Status Register 

This register contains the output ready flag and the interrupt enable control bit for Outputlnt. The output 
ready flag Is set to indicate that the data output buffer Is empty. It Is reset low only when data Is written to the 
data output buffer; it is set high by Reset. When writing to this register, the output ready bit must be written 
as zero. 

The interrupt enable bit can be set and reset by writing to the status register with this bit high or low respec- 
tively. When the interrupt enable and output ready flags are both high, the Outputlnt output will be high 
(page 518). Resetting interrupt enable will take Outputlnt low; setting it again whilst the data output register 
is empty will set Outputlnt high again. The interrupt enable bit can be read to determine it's status. 

When writing to this register, bits 2-7 must be written as zero; this ensures that they will be zero when the 
register Is read. Failure to write zeroes to these bits may result In undefined data being returned by these 
bits during a status register read. 



6.5 



Inputlnt 



The Inputlnt output Is set high to indicate that a data packet has been received from the serial link. It Is 
inhibited from going high when the interrupt enable bit In the input status register Is low (page 517). Inputlnt 
Is reset low when data Is read from the Input data register (page 514) and by Reset (page 508). 
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Figure 6.4 IMS C01 1 output status register 



6.6 



Outputlnt 



The Outputlnt output is set high to Indicate that the link is free to receive data from the microprocessor for 
transmission as a data packet out of the serial link. It is inhibited from going high when the interrupt enable 
bit in the output status register is low (page 517). Outputlnt is reset low when data is written to the data 
output register (page 517); it is set low by Reset (page 508). 



6.7 



Data read 



A data packet received on the input link sets the data present Hag in the input status register. If the interrupt 
enable bit In the status register is set, the Inputint output pin will be set high. The microprocessor will either 
respond to the interrupt (if the interrupt enable bit is set) or will periodically read the input status register until 
the data present bit is high. 

When data is available from the link, the microprocessor reads the data packet from the data input register. 
This will reset the data present 1\aQ and cause the link adaptor to transmit an acknowledge packet out of the 
serial link output. Inputint is automatically reset by reading the data input register; it is not necessary to read 
or write the Input status register. 



6.8 



Data write 



When the data output buffer is empty the output ready flag In the output status register is set high. If 
the interrupt enable bit in the status register is set, the Outputlnt output pin will also be set high. The 
microprocessor will either respond to the interrupt (if the interrupt enable bit is set) or will periodically read 
the output status register until the output ready bit is high. 

When the output ready flag Is high, the microprocessor can write data to the data output buffer. This will 
result in the link adaptor resetting the output ready Hag and commencing transmission of the data packet out 
of the serial link. The output ready status bit will remain low until an acknowledge packet is received by the 
input link. This will set the output reac/y flag high; if the interrupt enable bit is set, Outputlnt will also be set 
high. 
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7 Electrical specifications 

7.1 DC electrical characteristics 



Table 7.1 Absolute maximum ratings 



SYMBOL 


PARAIVIETER , 


IVIIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 





7.0 


V 


1,2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1.2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




600 


mW 





Notes 



1 All voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 7.2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 

VI, vo 

CL 
TA 


DC supply voltage 
Input or output voltage 
Load capacitance on any pin 
Operating temperature range 


4.75 





5.25 
VCC 

60 

70 


V 
V 
PF 
°C 


1 
1.2 

3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 7.3 DC characteristics 



SYIVIBOL 


PARAIVIETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level input voltage 




2.0 


VCC+0.5 


V 


1,2 


VIL 


Low level input voltage 




-0.5 


0.8 


V 


1,2 


II 


Input current 


@ GND<VI<VCC 




±10 
±200 


mA 


1,2,7 
1.2.8 


VOH 


Output high voltage 


@ IOH=2mA 


VCC-1 




V 


1,2 


VOL 


Output low voltage 


@ IOL=4mA 




0.4 


V 


1,2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1 .2.3.6 








65 


100 


mA 


1,2,4,6 


lOZ 


Tristate output current 


@ GND<VO<VCC 




±10 


/xA 


1,2 


PD 


Power dissipation 






120 


mW 


2,5 


CIN 


Input capacitance 


@f=:1MHZ 




7 


PF 


6 


COZ 


Output capacitance 


@f=1MHz 




10 


PF 


6 



Notes 



1 All voltages are with respect to GND. 

2 Parameters for IMS C011-S measured at 4.75V<VCC<5.25V and 0°C<TA<70°C. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading. 

6 This parameter is sampled and not 100% tested. 

7 For inputs other than those in Note 8. 

8 For pins 2. 3. 5. 6, 7. 9, 11. 13. 15. 16, 25. 



7.2 Equivalent circuits 



VCC 



R1 



Output 



50pF 
GND - 



R2 



Load for: 


R1 


R2 


Equivalent load: 


Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky TTL inputs 



Diodes are 1 N91 6 



Figure 7.1 Load circuit for AC measurements 



7 Electrical specifications 



521 




Figure 7.2 AC measurements timing waveforms 



Test point 
Output under test 

GND 



50pF4= 



510R ^ 



vcc 



Figure 7.3 Tristate load circuit for AC measurements 
7.3 AC timing characteristics 

Table 7.4 Input, output edges 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


TDr 


Input rising edges 


2 


20 


ns 


1.2 


TDf 


Input falling edges 


2 


20 


ns 


1.2 


TQr 


Output rising edges 




25 


ns 


1 


TQf 


Output falling edges 




15 


ns 


1 


CSLaHZ 


Chip select high to tristate 




25 


ns 




CSLaLZ 


Chip select low to tristate 




25 


ns 





Notes 



1 Non-link pins; see section on links. 

2 All inputs except Ciockin; see section on Ciockln. 
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Figure 7.4 IMS C01 1 input and output edge timing 
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Figure 7.5 IMS C011 tristate timing relative to notCS 
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Notes 



Figure 7.6 Typical rise/fall times 



1 Skew is measured between notCS with a standard load (2 Schottky TTL inputs and SOpF) and 
notes with a load of 2 Schottky TTL inputs and varying capacitance. 
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7.4 Power rating 

Internal power dissipation Pjnt of transputer and peripheral chips depends on VCC, as shown in figure 7.7. 
Pint 's substantially independent of temperature. 

Total power dissipation Pd of the chip is 

Pd = Pint + Pio 
where Pjo is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj^Ta+9Ja*Pd 

where Ta is the external ambient temperature in °C and OJa is the junction-to-ambient thermal resistance in 
°C/W. OJa for each package is given in the Packaging Specifications section. 



200 - 

Power 150 - 
PINT 

^^ 100- 
50 - 


^^ 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 

4.4 4.6 4.8 5.0 5.2 5.4 5.6 
VCC Volts 



Figure 7.7 IMS C011 internal power dissipation vs VCC 



524 



8 Package specifications 

8.1 28 pin plastic dual-in-line package 





-^ A ^ 


1 1 
B 
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Figure 8.1 28 pin plastic dual-in-line package dimensions 



Table 8.1 28 pin plastic dual-in-line package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


36.830 


±0.254 


1.450 


±0.010 




B 


13.970 


±0.254 


0.550 


±0.010 




C 


4.445 


±0.635 


0.175 


±0.025 




D 


15.240 


±0.076 


0.600 


±0.003 




E 


1.905 




0.075 






F 


2.540 




0.100 






G 


1.397 


±0.254 


0.055 


±0.010 




H 


0.457 




0.018 






J 


16.256 


±0.508 


0.640 


±0.020 




L 


0.508 




0.020 




Minimum 


M 


3.429 




0.135 




Maximum 



Package weight is approximately 4 grams 

Table 8.2 28 pin plastic dual-in-line package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/mm transverse air flow 


110 




°c/w 
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8.2 28 pin ceramic dual-in-line package 
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Figure 8.2 28 pin ceramic dual-in-line package dimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


35.560 


±0.356 


1.400 


±0.014 




B 


15.494 


±0.254 


0.610 


±0.010 




C 


14.681 


+0.813 


0.578 


+0.032 




D 


2.466 


±0.229 


0.097 


±0.009 




E 


1.270 


±0.254 


0.051 


±0.010 




F 


3.048 




0.120 




Minimum 


G 


2.540 




0.100 






H 


0.457 


±0.051 


0.018 


±0.002 




J 


1.016 


+0.508 


0.040 


+0.020 





Table 8.3 28 pin ceramic dual-in-line package dimensions 
Package weight is approximately 5 grams 



Table 8.4 28 pin ceramic dual-in-line package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 




60 




°G/W 
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8.3 28 pin SOIC package 

New product - for availability contact INMOS. 




Figure 8.3 28 pin SOIC package dimensions 



"Table 8.5 28 pin SOIC package dimensions 



DIM 


Millimetres 


Inches 


Notes 


MIN 


MAX 


MIN 


MAX 


A 


17.526 


18.491 


0.697 


0.728 


1 


B 


1 1 .506 


12.700 


0.453 


0.500 




C 


8.230 


8.890 


0.324 


0.350 


1 


D 


2.337 


2.692 


0.092 


0.106 




E 


0.356 


0.508 


0.014 


0.020 




F 


0.356 


0.610 


0.014 


0.024 


3 


G 


1.270 




0.050 




basic 


H 




3.048 




0.120 




J 


0.051 


0.356 


0.002 


0.014 




K 


0.152 


0.317 


0.006 


0.012 




L 


0.406 


1.270 


0.016 


0.050 




a 


0° 


8° 









Notes 



1 Overall length and width dimensions do not include mold flash or protrusions. Mold flash or protrusions shall 
not exceed 0.006 inches per side. 

2 Formed leads shall be planar with respect to one another within 0.004 inches at seating plane. 

3 F is to allow for positive dambar protrusion. 
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8.4 



Pinout 









VCC 


LinkOut 1 [ 


yj 


] 28 


Linkin 2 t 


3 27 


CapMinus 


IValid 3 [ 




] 26 


QValid 


lAck 4 [ 




] 25 


QAck 


10 5 [ 




] 24 


QO 


11 6 [ 




] 23 


Q1 


12 7 c 




] 22 


Q2 


13 8 [ 




3 21 


Q3 


14 9 [ 




] 20 


Q4 


15 10 t 




3 19 


Q5 


16 11 c 




] 18 


Q6 


17 12 I 




3 17 


Q7 


Reset 13 c 




3 16 


SeparatelQ 


GND 14 [ 




3 15 


Clockln 









Figure 8.4 IMS C011 Mode 1 pinout 









VCC 


LinkOut 1 C 


w 


3 28 


Linkin 2 C 


3 27 


CapMinus 


RnotW 3 I 




3 26 


Inputint 


Outputlnt 4 [ 




3 25 


notes 


RSO 5 C 




] 24 


DO 


RSI 6 [ 




3 23 


D1 


DoNotWire 7 c 




3 22 


D2 


D3 8 C 




3 21 


DoNotWire 


DoNotWire 9 C 




3 20 


D4 


D5 10 E 




3 19 


DoNotWire 


HoldToGND 11 c 




3 18 


D6 


D7 12 I 




3 17 


LinkSpeed 


Reset 13 c 




3 16 


SeparatelQ 


GND 14 C 




3 15 


Clockln 









Figure 8.5 IMS C011 Mode 2 pinout 
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9 Ordering 

This section indicates the designation of package selections for the IMS C011. Speed of Clockln is 5 MHz 
for all parts. 

For availability contact local INMOS sales office or authorised distributor. 

Table 9.1 IMS C011 ordering details 



INMOS designation 


Package 


IMSC011-P20S 
IMSC011-S20S 

IIVIS C011-S20IVI 


28 pin plastic dual-in-line 
28 pin ceramic sidebraze 

28 pin ceramic sidebraze MIL Spec 
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IMS C012 

ink adaptor 



Qonmos 



Engineering Data 



FEATURES 

Standard INMOS link protocol 

10 or 20 Mbits/sec operating speed 

Communicates with INMOS transputers 

Converts between serial link and parallel bus 

Tristate bidirectional bus interface 

Memory mapped registers 

Interrupt capability 

Single +5V ±5% power supply 

TTL and CMOS compatibility 

120mW power dissipation 

Standard 24 pin 0.3" plastic package 

APPLICATIONS 

Connecting microprocessors to transputers 
High speed links between microprocessors 
Inter-family microprocessor interfacing 
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Introduction 



The INMOS communication link is a high speed system interconnect which provides full duplex communication 
between members of the INMOS transputer family, according to the INMOS serial link protocol. The IMS C01 2, 
a member of this family, provides for full duplex transputer link communication with standard microprocessor 
and sub-system architectures, by converting bi-directional serial link data into parallel data streams. 

All INMOS products which use communication links, regardless of device type, support a standard commu- 
nications frequency of 1 Mbits/sec; most products also support 20 Mbits/sec. Products of different type 
or performance can, therefore, be interconnected directly and future systems will be able to communicate 
directly with those of today. The IMS C012 link will run at either the standard speed of 1 Mbits/sec or at the 
higher speed of 20 Mbits/sec. Data reception is asynchronous, allowing communication to be independent 
of clock phase. 

The IMS C012 provides an interface between an INMOS serial link and a microprocessor system bus. Status 
and data registers for both input and output ports can be accessed across the byte-wide bi-directional interface. 
Two interrupt outputs are provided, one to indicate input data available and one for output buffer empty. 
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Figure 1.1 IMS C012 block diagram 
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Pin designations 



Table 2.1 IMS C012 services and link 



Pin 


In/Out 


Function 


VCC, GND 
CapMinus 
Clockln 
Reset 

Linkin 
LinkOut 


in 
in 

in 
out 


Power supply and return 

External capacitor for internal clock power supply 

Input clock 

System reset 

Serial data input channel 
Serial data output channel 



Table 2.2 IMS C012 parallel interface 



Pin 


In/Out 


Function 


DO-7 


in/out 


Bi-directional data bus 


notes 


in 


Chip select 


RSO-1 


in 


Register select 


RnotW 


in 


Read/write control signal 


Inputint 


out 


Interrupt on link receive buffer full 


Outputint 


out 


Interrupt on link transmit buffer empty 


LinkSpeed 


in 


Select link speed as 10 or 20 Mbits/sec 


HoldToGND 




Must be connected to GND 



Signal names are prefixed by not if they are active low, otherwise they are active high. 
Pinout details for various packages are given on page 548. 
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3 System services 

System services include all the necessary logic to start up and maintain the IMS C012. 

3.1 Power 

Power is supplied to the device via the VCC and GND pins. The supply must be decoupled close to the chip 
by at least one 100 nF low inductance (e.g. ceramic) capacitor between VCC and GND. Four layer boards 
are recommended; if two layer boards are used, extra care should be taken in decoupling. 

AC noise between VCC and GND must be kept below 200 mV peak to peak at all frequencies above 100 KHz. 
AC noise between VCC and the ground reference of load capacitances must be kept below 200 mV peak to 
peak at all frequencies above 30 MHz. Input voltages must not exceed specification with respect to VCC and 
GND, even during power-up and power-down ramping, otherwise latchup can occur. CMOS devices can be 
permanently damaged by excessive periods of latchup. 

3.2 CapMinus 

The internally derived power supply for internal clocks requires an external low leakage, low inductance VF 
capacitor to be connected between VCC and CapMinus. A ceramic capacitor is preferred, with an impedance 
less than 3 Ohms between 1 00 KHz and 1 MHz. If a polarised capacitor is used the negative terminal should 
be connected to CapMinus. Total PCB track length should be less than 50 mm. The positive connection of 
the capacitor must be connected directly to VCC. Connections must not othenA/ise touch power supplies or 
other noise sources. 
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1 
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GND -I- 


CapMinus 
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Figure 3.1 Recommended PLL decoupling 



3.3 Clockln 



Transputer family components use a standard clock frequency, supplied by the user on the Clockln input. 
The nominal frequency of this clock for all transputer family components is 5 MHz, regardless of device type, 
transputer word length or processor cycle time. High frequency internal clocks are derived from Clockln, 
simplifying system design and avoiding problems of distributing high speed clocks externally. 

A number of transputer family devices may be connected to a common clock, or may have individual clocks 
providing each one meets the specified stability criteria. In a multi-clock system the relative phasing of 
Clockln clocks is not important, due to the asynchronous nature of the links. Mark/space ratio is unimportant 
provided the specified limits of Clockln pulse widths are met. 

Oscillator stability is important. Clockln must be derived from a crystal oscillator; RC oscillators are not 
sufficiently stable. Clockln must not be distributed through a long chain of buffers. Clock edges must be 
monotonic and remain within the specified voltage and time limits. 



3 System services 
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Table 3.1 Input clock 



SYMBOL 


PARAMETER 


MiN 


NOM 


MAX 


UNITS 


NOTE 


TDCLDCH 


Clockln pulse width low 


40 






ns 


1 


TDCHDCL 


Clockln pulse width high 


40 






ns 


1 


TDCLDCL 


Clockln period 




200 


400 


ns 


1,2,4 


TDCerror 


Clockln timing error 






±0.5 


ns 


1.3 


TDC1 DC2 


Difference in Clockln for 2 linked devices 






400 


ppm 


1,4 


TDCr 


Clockln rise tinne 






10 


ns 


1,5 


TDCf 


Clockln fall time 






8 


ns 


1,5 



Notes 



1 These paramters are not tested. 

2 Measured between corresponding points on consecutive falling edges. 

3 Variation of individual falling edges from their nominal times. 

4 This value allows the use of 200ppm crystal oscillators for two devices connected together by a link. 

5 Clock transitions must be monotonic within the range VIH to VIL (table 6.3). 



TDCerror 



1.5v- 



2.0v ^ 



^ TDCerror 



TDCerror ^ 



TDCLDCH 




TDCHDCL 
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Figure 3.2 Clockln timing 
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3.4 



Reset 



The Reset pin can go high with VCC, but must at no time exceed the maximum specified voltage for VIH. 
After VCC is valid Clockln should be running for a minimum period TDCVRL before the end of Reset. Linkin 
must be held low during Reset. 

Reset initialises the IMS C012 to the following state: LinkOut is held low; the interrupt outputs Inputint and 
Outputint are held low; interrupts are disabled; DO-7 are high impedance. 

Table 3.2 Reset 



SYMBOL 


PARAMETER 


MiN 


NOM 


MAX 


UNITS 


NOTE 


TPVRH 
TRHRL 
TDCVRL 


Power valid before Reset 

Reset pulse width high 

Clockln running before Reset end 


10 
8 
10 






ms 

Clockln 

ms 


1 
2 



Notes 



1 Full periods of Clockln TDCLDCL required. 

2 At power-on reset. 



Clockln 



VCC 



Reset 




V7\ \////////////////A [//////// 



TPVRH ^ 



TDCVRL 



TRHRL 




Figure 3.3 Reset Timing 
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4 Links 

INMOS bi-directional serial links provide synchronized communication between INMOS products and with 
the outside world. Each link comprises an input channel and output channel. A link between two devices is 
implemented by connecting a link interface on one device to a link interface on the other device. Every byte 
of data sent on a link is acknowledged on the input of the same link, thus each signal line carries both data 
and control information. 

The quiescent state of a link output is low. Each data byte is transmitted as a high start bit followed by a 
one bit followed by eight data bits followed by a low stop bit. The least significant bit of data is transmitted 
first. After transmitting a data byte the sender waits for the acknowledge, which consists of a high start bit 
followed by a zero bit. The acknowledge signifies both that a process was able to receive the acknowledged 
data byte and that the receiving link is able to receive another byte. 

Links are not synchronised with Clockln and are insensitive to its phase. Thus links from independently 
clocked systems may communicate, providing only that the clocks are nominally identical and within specifi- 
cation. 

Links are TTL compatible and intended to be used in electrically quiet environments, between devices on a 
single printed circuit board or between two boards via a backplane. Direct connection may be made between 
devices separated by a distance of less than 300 millimetres. For longer distances a matched 100 ohm 
transmission line should be used with series matching resistors RM. When this is done the line delay should 
be less than 0.4 bit time to ensure that the reflection returns before the next data bit is sent. 

Buffers may be used for very long transmissions. If so, their overall propagation delay should be stable within 
the skew tolerance of the link, although the absolute value of the delay is immaterial. 

The IMS C012 link supports the standard INMOS communication speed of 10 Mbits/sec. In addition it can 
be used at 20 Mbits/sec. Link speed is selected by LinkSpeed; when the LinkSpeed pin is low, the link 
operates at the standard 1 Mbits/sec; when high it operates at 20 Mbits/sec. 





H L, 
1 Ack 1 


|h'H 01234567 L, 


1 Data 1 



Figure 4.1 IMS C012 link data and acknowledge packets 
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Table 4.1 Link 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TJQr 


LinkOut rise time 






20 


ns 


1 


TJQf 


LinkOut fall time 






10 


ns 


1 


TJDr 


Linkin rise time 






20 


ns 


1 


TJDf 


Linkin fall time 






20 


ns 


1 


TJQJD 


Buffered edge delay 









ns 




TJBskew 


Variation in TJQJD 20 Mbits/s 






3 


ns 


2 




10Mbits/s 






10 


ns 


2 


CLIZ 


Linkin capacitance @ f=1 MHz 






7 


PF 


1 


CLL 


LinkOut load capacitance 






50 


PF 




RM 


Series resistor for 100n transmission line 




56 




ohms 





Notes 



1 These paramters are sampled, but are not 1 00% tested. 

2 This is the variation in the total delay through buffers, transmission lines, differential receivers etc., caused by 
such things as short term variation in supply voltages and differences in delays for rising and falling edges. 




Figure 4.2 IMS C012 link timing 
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Figure 4.3 IMS C012 buffered link timing 
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Figure 4.4 IMS C012 Links directly connected 
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Figure 4.5 IMS C012 Links connected by transmission line 
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Figure 4.6 IMS C012 Links connected by buffers 
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5 Parallel Interface 

The IMS C012 provides an interface between a link and a microprocessor style bus. Operation of the link 
adaptor is controlled through the parallel interface bus lines DO-7 by reading and writing various registers in 
the link adaptor. Registers are selected by RSO-1 and RnotW, and the chip enabled with notCS. 

For convenience of description, the device connected to the parallel side of the link adaptor is presumed to 
be a microprocessor, although this will not always be the case. 



5.1 



DO-7 



Data is communicated between a microprocessor bus and the link adaptor via the bidirectional bus lines DO-7. 
The bus is high impedance unless the link adaptor chip is selected and the RnotW line is high. The bus is 
used by the microprocessor to access status and data registers. 



5.2 



notes 



The link adaptor chip is selected when notCS is low. Register selectors RSO-1 and RnotW must be valid 
before notCS goes low; DO-7 must also be valid if writing to the chip (RnotW low). Data is read by the link 
adaptor on the rising edge of notCS. 



5.3 



RnotW 



RnotW, in conjunction with notCS, selects the link adaptor registers for read or write mode. When RnotW 
is high, the contents of an addressed register appear on the data bus DO-7; when RnotW is low the data 
on DO-7 is written into the addressed register. The state of RnotW is latched into the link adaptor by notCS 
going low; it may be changed before notCS returns high, within the timing restrictions given. 



5.4 



RSO-1 



One of four registers is selected by RSO-1. A register Is addressed by setting up RSO-1 and then taking 
notes low; the state of RnotW when notCS goes low determines whether the register will be read or written. 
The state of RSO-1 is latched into the link adaptor by notCS going low; it may be changed before notCS 
returns high, withiri the timing restrictions given. The register set comprises a read-only data input register, 
a write-only data output register and a read/write status register for each. 

Table 5.1 IMS C012 register selection 



RSI 


RSO 


RnotW 


Register 








1 


Read data 











Invalid 





1 


1 


Invalid 





1 





Write data 


1 





1 


Read input status 


1 








Write input status 


1 


1 


1 


Read output status 


1 


1 





Write output status 



5.4.1 Input Data Register 

This register holds the last data packet received from the serial link. It never contains acknowledge packets. 
It contains valid data only whilst the data present Hag is set in the input status register. It cannot be assumed 
to contain valid data after it has been read; a double read may or may not return valid data on the second 
read. If data present is valid on a subsequent read it indicates new data is in the buffer. Writing to this register 
will have no effect. 



5 Parallel interface 
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Table 5.2 IMS C012 parallel Interface control 








SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TRSVCSL 


Register select setup 


5 






ns 




TCSLRSX 


Register select hold 


5 






ns 




TRWVCSL 


Read/write strobe setup 


5 






ns 




TCSLRWX 


Read/write strobe hold 


5 






ns 




TCSLCSH 


Chip select active 


50 






ns 




TCSHCSL 


Delay before re-assertion of chip select 


50 






ns 





Table 5.3 IMS C012 parallel interface read 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TLdVIIH 


Start of link data to Inputint high 






13 


bits 


1 


TCSLIIL 


Chip select to Inputint low 






30 


ns 




TCSLDrX 


Chip select to bus active 


5 






ns 




TCSLDrV 


Chip select to data valid 






40 


ns 




TCSHDrZ 


Chip select high to bus tristate 






25 


ns 




TCSHDrX 


Data hold after chip select high 


5 






ns 




TCSHLaV 


Chip de-select to start of Ack 


0.8 




2 


bits 


1,2 



Notes 



1 Unit of measurement is one link data bit time; at 10 Mbits/s data link speed, one bit time is nominally 100 ns. 

2 Maximum time assumes there is no data packet already on the link. Maximum time with data on the link is 
extended by 1 1 bits. 
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Figure 5.1 IMS C012 read parallel data fronn link adaptor 
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Table 5.4 IMS C012 parallel interface write 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


TCSHDwV 

TCSHDwX 

TCSLOIL 

TCSHLcJV 

TLaVOIH 

TLdVOIH 


Data setup 

Data hold 

Chip select to Outputint low 

Chip select high to start of link data 

Start of link Ack to Outputint high 

Start of link data to Outputint high 


15 
5 

0.8 




30 
2 
3 

13 


ns 
ns 
ns 
bits 
bits 
bits 


1,2 
1,3 
1,3 



Notes 



1 Unit of measurement is one link data bit time; at 10 Mbits/s data link speed, one bit time is nominally 100 ns. 

2 Maximum time assumes there is no acknowledge packet already on the link. Maximum time with acknowledge 
on the link Is extended by 2 bits. 

3 Both data transmission and the returned acknowledge must be completed before Outputint can go high. 
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Figure 5.2 IMS C012 write parallel data to link adaptor 
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5.4.2 Input Status Register 

This register contains the data present flag and the interrupt enable control bit for Inputlnt. The data present 
flag is set to indicate that data in the data input buffer is valid. It is reset low only when the data input buffer 
is read, or by Reset. When writing to this register, the data present bit must be written as zero. 

The interrupt enable bit can be set and reset by writing to the status register with this bit high or low re- 
spectively. When the interrupt enable and data present Hags are both high, the Inputlnt output will be high 
(page 541). Resetting interrupt enable will take Inputlnt low; setting it again before reading the data input 
register will set Inputlnt high again. The interrupt enable bit can be read to determine its status. 

When writing to this register, bits 2-7 must be written as zero; this ensures that they will be zero when the 
register is read. Failure to write zeroes to these bits may result in undefined data being returned by these 
bits during a status register read. 
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Figure 5.3 IMS C012 input status register 

5.4.3 Output Data Register 

Data written to this link adaptor register is transmitted out of the serial link as a data packet. Data should 
only be written to this register when the output ready bit in the output status register is high, otherwise data 
already being transmitted may be corrupted. Reading this register will result in undefined data being read. 

5.4.4 Output Status Register 

This register contains the output ready flag and the interrupt enable control bit for Outputlnt. The output 
ready flag is set to indicate that the data output buffer is empty. It is reset low only when data is written to the 
data output buffer; it is set high by Reset. When writing to this register, the output ready bit must be written 
as zero. 

The interrupt enable bit can be set and reset by writing to the status register with this bit high or low respec- 
tively. When the interrupt enable and output ready flags are both high, the Outputlnt output will be high 
(page 542). Resetting interrupt enable will take Outputlnt low; setting it again whilst the data output register 
is empty will set Outputlnt high again. The interrupt enable bit can be read to determine it's status. 

When writing to this register, bits 2-7 must be written as zero; this ensures that they will be zero when the 
register is read. Failure to write zeroes to these bits may result in undefined data being returned by these 
bits during a status register read. 



5.5 



Inputlnt 



The Inputlnt output is set high to indicate that a data packet has been received from the serial link. It is 
inhibited from going high when the interrupt enable bit in the input status register is low (page 541). Inputlnt 
is reset low when data is read from the input data register (page 538) and by Reset (page 534). 
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Figure 5.4 IMS C012 output status register 



5.6 



Outputint 



The Outputint output is set high to indicate that the link is free to receive data from the microprocessor for 
transmission as a data packet out of the serial link. It is inhibited from going high when the interrupt enable 
bit in the output status register is low (page 541). Outputint is reset low when data is written to the data 
output register (page 541); it is set low by Reset (page 534). 



5.7 



Data read 



A data packet received on the input link sets the data present flag in the input status register. If the interrupt 
enable bit in the status register is set, the Inputint output pin will be set high. The microprocessor will either 
respond to the interrupt (if the interrupt enable bit is set) or will periodically read the input status register until 
the data present b\t is high. 

When data is available from the link, the microprocessor reads the data packet from the data input register. 
This will reset the data present Uag and cause the link adaptor to transmit an acknowledge packet out of the 
serial link output. Inputint is automatically reset by reading the data input register; it is not necessary to read 
or write the Input status register. 



5.8 



Data write 



When the data output buffer is empty the output ready flag in the output status register is set high. If 
the interrupt enable bit in the status register is set, the Outputint output pin will also be set high. The 
microprocessor will either respond to the interrupt (if the interrupt enable bit is set) or will periodically read 
the output status register until the output ready bit is high. 

When the output ready flag is high, the microprocessor can write data to the data output buffer. This will 
result in the link adaptor resetting the output ready ^lag and commencing transmission of the data packet out 
of the serial link. The output ready status bit will remain low until an acknowledge packet is received by the 
input link. This will set the output ready flag high; if the interrupt enable bit is set, Outputint will also be set 
high. 
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6 Electrical specifications 

6.1 DC electrical characteristics 



Table 6.1 Absolute maximum ratings 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 


DC supply voltage 





7.0 


V 


1.2,3 


VI, vo 


Voltage on input and output pins 


-0.5 


VCC+0.5 


V 


1,2,3 


II 


Input current 




±25 


mA 


4 


OSCT 


Output short circuit time (one pin) 




1 


s 


2 


TS 


Storage temperature 


-65 


150 


°C 


2 


TA 


Ambient temperature under bias 


-55 


125 


°C 


2 


PDmax 


Maximum allowable dissipation 




600 


mW 





Notes 



1 All voltages are with respect to GND. 

2 This is a stress rating only and functional operation of the device at these or any other conditions beyond those 
indicated in the operating sections of this specification is not implied. Stresses greater than those listed may 
cause permanent damage to the device. Exposure to absolute maximum rating conditions for extended periods 
may affect reliability. 

3 This device contains circuitry to protect the inputs against damage caused by high static voltages or electrical 
fields. However, it is advised that normal precautions be taken to avoid application of any voltage higher than the 
absolute maximum rated voltages to this high impedance circuit. Unused inputs should be tied to an appropriate 
logic level such as VCC or GND. 

4 The input current applies to any input or output pin and applies when the voltage on the pin is between GND 
and vcc. 



Table 6.2 Operating conditions 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VCC 

VI, vo 

CL 
TA 


DC supply voltage 
Input or output voltage 
Load capacitance on any pin 
Operating temperature range 


4.75 





5.25 

VCC 

60 

70 


V 
V 
PF 
°C 


1 
1.2 

3 



Notes 



1 All voltages are with respect to GND. 

2 Excursions beyond the supplies are permitted but not recommended; see DC characteristics. 

3 Air flow rate 400 linear ft/min transverse air flow. 
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Table 6.3 DC characteristics 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


VIH 


High level Input voltage 


2.0 


VCC+0.5 


V 


1,2 


VIL 


Low level input voltage 


-0.5 


0.8 


V 


1,2 


II 


Input current @ GND<VI<VCC 




±10 


/iA 


1.2,7 








±200 


/iA 


1,2.8 


VOH 


Output high voltage @ IOH=2mA 


VCC-1 




V 


1,2 


VOL 


Output low voltage @ IOL=4mA 




0.4 


V 


1,2 


lOS 


Output short circuit current @ GND<VO<VCC 


36 


65 


mA 


1,2,3,6 






65 


100 


mA 


1,2,4,6 


lOZ 


Tristate output current @ GND<VO<VCC 




±10 


mA 


1,2 


PD 


Power dissipation 




120 


mW 


2,5 


CIN 


Input capacitance @ f=1 MHz 




7 


PF 


6 


COZ 


Output capacitance @ f=1 MHz 




10 


PF 


6 



Notes 



1 All voltages are with respect to GND. 

2 Parameters for IMS C012-S measured at 4.75V<VCC<5.25V and 0°C<TA<70X. 
Input clock frequency = 5 MHz. 

3 Current sourced from non-link outputs. 

4 Current sourced from link outputs. 

5 Power dissipation varies with output loading. 

6 This parameter is sampled and not 100% tested. 

7 For inputs other than those in Note 8. 

8 For pins 2, 3, 5, 6, 7, 9. 11, 13, 14, 21. 



6.2 Equivalent circuits 



VCC 



Output 




Load for: 


R1 


R2 


Equivalent load: 


Link outputs 
Other outputs 


1K96 
970R 


47K 
24K 


1 Schottky TTL input 

2 Schottky TTL inputs 



Diodes are 1N916 



Figure 6.1 Load circuit for AC measurements 
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Figure 6.2 AC measurements timing waveforms 
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6.3 



Figure 6.3 Tristate load circuit for AC measurements 
AC timing characteristics 

Table 6.4 Input, output edges 



SYMBOL 


PARAMETER 


MIN 


MAX 


UNITS 


NOTE 


TDr 


Input rising edges 


2 


20 


ns 


1,2 


TDf 


Input falling edges 


2 


20 


ns 


1.2 


TQr 


Output rising edges 




25 


ns 


1 


TQf 


Output falling edges 




15 


ns 


1 


CSLaHZ 


Chip select high to tristate 




25 


ns 




CSLaLZ 


Chip select low to tristate 




25 


ns 





Notes 



1 Non-link pins; see section on links. 

2 All inputs except Clockln; see section on Clockln. 
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Figure 6.4 IMS C012 input and output edge timing 
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Figure 6.5 IMS C012 tristate timing relative to notCS 
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Figure 6.6 Typical rise/fall times 



1 Skew is measured between notCS with a standard load (2 Schottky TTL inputs and SOpF) and 
notes with a load of 2 Schottky TTL inputs and varying capacitance. 
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6.4 Power rating 

Internal power dissipation P/jvr of transputer and peripheral chips depends on VCC, as shown in figure 6.7. 
Pint is substantially independent of temperature. 

Total power dissipation Pp of the chip is 

Pd = Pint + Pio 
where Pjo is the power dissipation in the input and output pins; this is application dependent. 
Internal working temperature Tj of the chip is 

Tj^Ta+BJa^Pd 

where Ta is the external ambient temperature in °C and SJa is the junction-to-ambient thermal resistance in 
°C/W. OJa for each package is given in the Packaging Specifications section. 
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Figure 6.7 IMS C012 internal power dissipation vs VCC 
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7 Package specifications 

7.1 24 pin plastic dual-in-line package 
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Figure 7.1 24 pin plastic dual-in-line package dimensions 
Table 7.1 24 pin plastic dual-in-line packagedimensions 



DIM 


Millimetres 


Inches 


Notes 


NOM 


TOL 


NOM 


TOL 


A 


31.242 


+0.508 
-0.254 


1.230 


+0.020 
-0.010 




B 


6.604 


±0.127 


0.260 


±0.005 




C 


3.302 


±0.381 


0.130 


±0.015 




D 


7.620 


±0.127 


0.300 


±0.005 




E 


1.651 


±0.127 


0.065 


±0.005 




F 


2.540 


±0.127 


0.100 


±0.005 




G 


1.524 


±0.127 


0.060 


±0.005 




H 


0.457 


±0.127 


0.018 


±0.005 




J 


8.382 


±0.508 


0.330 


±0.020 




K 


0.254 


±0.025 


0.010 


±0.001 




L 


0.508 


±0.127 


0.020 


±0.005 




M 


3.048 




0.120 




Minimum 



Package weight is approximately 2 grams 

Table 7.2 24 pin plastic dual-in-line package junction to ambient thermal resistance 



SYMBOL 


PARAMETER 


MIN 


NOM 


MAX 


UNITS 


NOTE 


^JA 


At 400 linear ft/min transverse air flow 




115 




°c/w 
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7.2 



Pinout 









VCC 


LinkOut 1 l 


\J 


1 24 


Linkin 2 i 




1 23 


CapMinus 


RnotW 3 c 




] 22 


Inputint 


Outputint 4 [ 




1 21 


notes 


RSO 5 [ 




1 20 


DO 


RSI 6 C 




3 19 


D1 


D3 7 [ 




3 18 


D2 


D5 8 [ 




1 17 


D4 


HoldToGND 9 [ 




1 16 


D6 


D7 10 c 




1 15 


LinkSpeed 


Reset 11 c 




1 14 


HoldToGND 


GND 12 C 




3 13 


Clockln 









Figure 7.2 IMS C01 2 pinout 
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8 Ordering 

This section indicates tiie designation of package selections for the IMS C012. Speed of Clockln is 5 MHz 
for all parts. 

For availability contact local INMOS sales office or authorised distributor. 

Table 8.1 IMS C012 ordering details 



INMOS designation 


Package 


IIVIS C012-P20S 


24 pin plastic dual-in-line 



QjUmoS Appendix A 



# quality and 

reliability 
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A Quality and Reliability 

The INMOS quality programme is set up to be attentive to every phase of the semiconductor product life 
cycle. This includes specific programmes in each of the following areas: 

• Total Quality Control (TQC) 

• Quality and Reliability in Design 

• Document Control 

• New Product Qualification 

• Product Monitoring Programme 

• Production Testing and Quality Monitoring Procedure 

A.1 Total quality control (TQC) and reliability programme 

Our objective to continuously build improved quality and reliability into every INMOS part has resulted in a 
comprehensive Quality/Reliability Programme of which we are proud. This programme demonstrates INMOS' 
serious commitment to supporting the quality and reliability needs of the electronics marketplace. 

INMOS is systematically shifting away from a traditional screening approach to quality control and towards 
one of building in Experimental Design quality through Statistical Process Control (SPC). This new direction 
was initiated with a vigorous programme of education and scientific method training. 

In the first year of the programme approximately 80 INMOS employees worldwide received thorough SPC 
training. This training has been extended to cover advanced SPC and experimental design. Some of the 
courses taught are listed below: 

• Experimental Design Techniques 

• Statistical Process Control Methods 

• Quality Concepts 

• Problem Solving Techniques 

• Statistical Software Analysis Techniques 

Today INMOS utilizes experimental design techniques and process control/monitoring throughout its devel- 
opment and manufacturing cycles. The following TQC tools are currently supported by extensive databases 
and analysis software. 

1 . Pareto charts 6. Correlation Plots 

2. Cause/Effect Diagrams 7. Control Charts 

3. Process Flow Charts 8. Experimental Design 

4. Run Charts 9. Process Capability Studies 

5. Histograms 

A.2 Quality and reliability in design 

The INMOS quality programme begins with the design of new INMOS products. The following procedures 
are examples from the INMOS programme to design quality and reliability into every product. 

Innovative design techniques are employed to achieve product performance using, whenever possible, state 
of the art techniques. For example, INMOS uses 300 nm gate oxides on its high performance graphics, 
SRAM and MICRO products to obtain the reliability inherent in the thicker gate oxide. In addition, circuit 
design engineers work hand in hand with process engineers to optimise the design for the process and the 



A Quality and Reliability 553 



process for the product family. The result is a highly reliable design implemented In a process technology 
achievable within manufacturing. 

INMOS products are designed to have parametric margins beyond the product target specifications. The 
design performance is verified using simulations of circuit performance over voltage and temperature values 
beyond those of specified product operation, including verification beyond the military performance range. In 
addition, the device models are chosen to ensure tolerance to wide variations in process parameters beyond 
those expected in manufacture. 

The design process includes consideration of quality issues such as signal levels available for sensing, 
reduction of internal noise levels, stored data integrity and testability of all device functions. Electro-static 
damage protection techniques are included in the design with input protection goals of 2K volts for MIL-STD- 
883 testing methods. Specific customer requirements can be met by matching their detailed specifications 
against INMOS designed in margins. 

The completion of the design includes the use of INMOS computer aided design software to fully check and 
verify the design and layout. This improves quality as well as ensuring the timely introduction of new products. 

A.3 Document control 

The Document Control Department maintains control over all manufacturing specifications, lot travellers, 
procurement specifications and drawings, reticle tapes and test programmes. New specifications and changes 
are subject to approval by the Engineering and Manufacturing managers or their delegates. Change is 
rigorously controlled through an Engineering Change Notice procedure, and OA department managers screen 
and approve all such changes. 

An extensive archiving system ensures that the history of any Change Notice is readily available. 

Document Control also has responsibility for controlling in-line documentation In all manufacturing areas which 
includes distribution of specifications, control of changes and liaison with production control and manufacturing 
in introducing changed procedures into the line. 

Extensive use is made of computer systems to control documentation on an international basis. 

A.4 New product qualification 

INMOS performs a thorough internal product qualification prior to the delivery of any new product, other than 
engineering samples of prototypes to customers. 

Care is taken to select a representative sample from the final prototype material. This typically consists of 
three different production lots. Testing is then done to assure the initial product reliability levels are achieved. 
Product qualifications are done in accordance with MIL-STD-883, methods 5004 and 5005, or CECC/BS9000. 

The initial INMOS qualification data, and the ongoing monitor data can be very useful in the user qualification 
decision process. INMOS also has a very successful history of performing customer qualification testing 
in-house and performing joint qualification programmes with customers. INMOS remains committed to joint 
customer/vendor programmes. 

A.5 Product monitoring programme 

At the levels of quality and reliability performance required today (low PPM and FIT levels), it is essential that 
a large statistically significant, current product database be maintained. One of the programmes that INMOS 
uses to accomplish this is the Product Monitoring Programme (PMP). 

The PMP is a comprehensive ongoing programme of reliability testing. A small sample is pulled from pro- 
duction lots of a particular part type. This population is then used to create the specific samples to put on 
the various operating and environmental tests. Tests run in this programme include extended temperature 
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operating life, THB and temperature cycle. Efforts are contiriuing to identify and correlate more accelerated 
tests to be used in the PMP. 

A.6 Production testing and quality monitoring procedure 

A.6.1 Reliability testing 

INMOS' primary reliability test method is to bias devices at their maximum rated operating power supply level 
in a 140° C ambient temperature. A scheme of time varying input signals is used to simulate the complete 
functional operation of the device. The failure rate is then computed from the results of the operating life 
test using Arrhenius modelling for each specific failure mechanism known. The failure rate is reported at a 
temperature that is a typical worst case application environment and is expressed in units of FITs where 1 
FIT = 1 Fail in 10E9 device hours, (100 FIT = 0.01 %/1 000 Hrs). The current database enables the failure 
rate to be valid over various environmental conditions. 

The failure rate goal for INMOS products is 100 FITs or less at product introduction with a 50 FIT level to be 
attained within one year. 

For plastic packaged product, additional testing methods and reliability indices become important. Humidity 
testing is used to evaluate the relative hermeticity of the package, and thermal cycling tests are used principally 
to evaluate the durability of the assembly (e.g. die/bond attach). 

The Humidity Test comprises of temperature, humidity, bias (THB) at 85°C, 85% Relative Humidity, and a 
5V static bias configuration selected to maintain the component in a state of minimum power dissipation and 
enhance the formation of galvanic corrosion. INMOS reliability goals have always been to meet or better 
the current 'industry standards' and a target of less than 1% failures through 1000 hours of THB at 90% 
confidence has been set. 

The Thermal Cycling tests are performed from -65°C to + 150 °C for 500-1000 cycles, with no bias applied. 
Thermal Shock tests using a liquid to liquid (Freon) method are cycled between -55°C and + 125 °C. 
The INMOS Reliability qualification and monitoring goal for the above tests is less than 1% failures at 90% 
confidence. 

A.6.2 Production testing 

Electrical testing at INMOS begins while the devices are still In wafer form before being divided into individual 
die. While in this form, two different types of electrical test are performed. 

The Parametric Probe test is to verify that the individual component parameters are within their design limits. 
This is accomplished by testing special components on the wafer. The results of these tests provide feedback 
to our wafer fab manufacturing facilities which allows them to ensure that the components used in the actual 
devices perform within their design limits. This testing is performed on all lots which are processed, and any 
substandard wafers discarded. These components are placed in the scribe streets of the wafer so they are 
destroyed in the dicing operation when they are not of any further use. By placing them there, valuable chip 
real estate is saved, thereby holding down cost while still providing the necessary data. 

The Electrical Probe test performed on all wafers is the test of each individual circuit or chip on every wafer. 
The defective dice are identified so they may be later discarded after the wafer has been separated into 
individual die. This test fully exercises the circuits for all AC and DC datasheet parameters in addition to 
verifying functionality. 

After the dice have been assembled into packages they are again tested in our Final Test operation. In a 
mature product the typical flow is: 

• Preburn-in test 

• Burn-in at 140°C 

• Final test 
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• PDA (Percent Defect Allowed) 

• Device Symbolisation 

• QA Final Acceptance 

The temperature setting used for hot testing is selected so that the junction temperature is the same as it 
would be after thermal stabilisation occurred in the specified environment. This is calculated using the hot 
temperature power dissipation along with the thermal resistance of the package used. All INMOS product is 
electrically tested and burned-in prior to shipment. Historically, the industry has selected burn-in times using 
the MIL Standards as a guide (when the market would support the cost) or on a 'best guess' basis dominated 
by cost considerations. Whereas INMOS invoke a burn-in reduction exercise to ensure the reduced time has 
no reliability impact. 

A.6.3 Quality monitoring procedure 

In the Outgoing Quality Monitoring programme, random samples are pulled from lots, that have been suc- 
cessfully tested to data sheet criteria. Rejected lots are 100% retested and more importantly failures are 
analysed and corrective actions identified to prevent the recurrence of specific problems. 

The extensive series of electrical tests with the associated Burn-in PDA limits and Quality Assurance tests 
ensure we will be able to continue to improve our high quality and reliability standards. 
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INMOS MIL-STD-883C/MIL-I-45208 
MATERIAL PROCUREMENT & PRODUCT FLOW 



Wafer Fab Mat'l Procurement [2] 

Wafer Fab Process [2] 

Wafer Level Elect. Test [2] 

Internal Visual QA Sample [2] 

Assembly Mat'l Procurement 

Assembly Mat'l QA Inspect 

Assembly Process [1] 

Electrical Testing 

Burn-in 

[Solder Dip] [3] 

QCI Group A,B 

QCI Group C,D 

Certificate of Conformance 

[CSI.GSI, PVT][4] 

Notes: 

[1] Anam, Korea or GTE, Taiwan 

[2] Newport Fab. Product: 
All NMOS, CMOS SRAM 
All Transputer 
AIIG17x(CLUT) 

[3] Hot Solder Dip as req'd at 
Colo. Spgs. Subcontractor 

[4] As required by Customer 



INMOS 

NEWPORT 

U.K. 



OFFSHORE 
ASSEMBLY 



1 



[5] 600 mil Package Parts, 
All MICRO &G17X Parts 

[6] 300 mil DIP. LCC & FLAT PACK 
SRAM Parts 



\7 Raw Material Procurement 
r^ Manufacturing Process 
I I QA Gate 



INMOS 

COL. SPGS. 

USA 



[6] 



BranOS Appendix B 

# index 
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Index 



! 13 






link IMS T801 171 


" 19 






link IMS T805 105 


18 






Address 


* 18, 114, 117, 180, 183, 250, 253, 321, 


388, 


bus IMS T??? 423 


443 






bus IMST414 359 


+ 18, 114, 117, 180, 183, 250, 253, 321, 


323, 


bus IMS T425 290 


388, 390, 443, 445 






bus IMS T800 221 


- 18, 114, 180, 250, 321, 323, 388, 390, 


443, 


bus IMS T801 160 


445 






buslMST805 80 


/ 18, 114, 117, 180, 183, 250, 253, 321, 


388, 


byte IMS T222 420 


443 






byte IMST414 355 


/\ 18.114.180,250.321,388.443 






byte IMS T425 286 


:= 13 






byte IMST800 217 


< 18. 114, 117, 180, 183, 250, 253, 321, 


323, 


byte IMS T801 157 


388, 390, 443, 445 






byte IMS T805 77 


« 18,114.180,250,321,388,443 






marklMSM212 465 


<= 18,114,117,180.183,250,253. 


32- 


1 , 323, 


refresh IMST414 373 


388. 390. 443, 445 






refresh IMS T425 305 


<> 18,114,117,180,183,250,253, 


32- 


1 . 323, 


refresh IMS T800 235 


388, 390, 443. 445 






refresh IMS T805 91 


= 18, 114, 117, 180, 183, 250, 253, 321, 


323, 


space IMS T222 400, 403 


388, 390, 443, 445 






space IMS T225 454 


> 18, 114. 117. 180, 183, 250, 253, 321, 


323, 


space IMST414 335,337 


388, 390, 443. 445 






space IMS T425 262, 265 


X 18,114.180,250,321,388,443 






space IMS T800 191,193 


>= 18,114,117,180,183,250,253, 


32- 


1 . 323, 


space IMST801 129. 132 


388, 390, 443, 445 






space IMS T805 49. 52 


» 18,114,180,250,321.388,443 






AFTER 18. 19. 40. 115, 181,251, 322, 389, 


? 13 






444 


\/ 18,114,180,250,321,388,443 






ALT 15, 16, 22, 29, 115, 119, 181, 184, 251 


- 1 8. 1 1 4. 1 80, 250, 321 , 388, 443 






255. 322, 325, 389, 391, 444. 446 
Alternation construction 14, 15, 16, 29, 40 


ABS 117,183,253 






Analyse 


Absolute maximum ratings 






IMST2?? 418,419 


IMSC004 494 






IMST414 353,354 


IMSC011 519 






IMST425 284,285 


IMSC012 543 






IMST800 215,216 


IMST222 437 






IMST801 155.156 


IMST414 382 






IMST805 75.76 


IMST425 315 






AND 18,114,180.250.321.388.443 


IMST800 244 






ANSI-IEEE 754-1985 17 


IMST801 174 






IMST800 190.209 


IMST805 108 






IMST801 128.149 


Access 






IMST805 48,69 


byte-wide IMS coil 504 






Application 


byte-wide IMS C012 530 






bidirectional exchange IMS C004 490 


Acknowledge 






bus systems IMS C004 490 


link 39 






drawing coloured text 43 


link IMS C004 485 






enhanced controller IMS M21 2 470 


link IMS C011 509 






IMST222 426 


linklMSC012 535 






IMST414 365 


link IMS T222 434 






IMST425 296 


linklMST414 379 






IMST800 227 


link IMS T425 312 






IMST805 81 



link IMS T800 241 



link switching IMS C004 490 
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multiple control IMS C004 490 

Winchester controller IMS M212 469, 470 
Architecture 28, 29 

internal 31 

rationale 9 
Arithmetic 

multiple length 116, 182, 252, 322, 389, 444 

operation IMS T222 403 

operation IMST414 337 

operation IMS T425 265 

operation IMS T800 193 

operation IMS T801 132 

operation IMS T805 52 

operator 114, 117, 180, 183, 250, 253, 321, 
388, 443 
Array 17 

assignment 119,1 84, 255, 325, 391 , 446 

byte 19 

of disk controllers IMS M212 471 

of processes 16 

of transputers IMS M212 471 

type 17 

variable 114, 180, 250, 321 , 388, 443 
ASCII 19 

ASHIFTLEFT 116.1 82, 252 
ASHIFTRIGHT 116, 182, 252 
Assignment 13, 29, 114, 117, 180, 183, 250, 
253,321,388,443 

array 119,1 84, 255, 325, 391 . 446 

process 12, 13 

Bandwidth 

memory 43 

memory IMS T222 405 

memory IMS T41 4 339 

memory IMS T425 267 

memory IMS T800 195 

memory IMS T801 134 

memory IMS T805 54 
Barrel shifter 42 
Behaviour 

logical 7,20.22 

physical 7 
Benchmark 

LINPACK 42 

speed 41 

Whetstone 41 
Bit 

counting performance 118, 184, 254, 324 

data 39 

operator 114, 180, 250, 321 , 388, 443 

reversal performance 118, 184. 254. 324 

start 39 

stop 39 
Bit-bit 43 

BITCOUNT 118. 184, 254, 324 
BITREVNBIT 118, 184, 254, 324 
BITREVWORD 1 1 8, 1 84, 254, 324 
Block move 43 

conditional 43 



IMST425 262.270 

IMST800 190, 198 

IMST801 128,137 

IMST805 48,57 

performance 118,1 84. 254, 324 

two-dimensional 43 
BOOL 17 
Boolean 

expression 119, 184, 255, 325, 391, 446 

operator 114, 180, 250, 321 . 388, 443 
BootFromRom 

IMST222 416,418 

IMST414 351,353 

IMST425 282,284 

IMST800 213,215 

IMST801 153,155 

IMST805 73,75 
Bootstrap 24. 25 

address IMST222 418 

address IMST414 353 

address IMS T425 284 

address IMST800 215 

address IMST801 155 

address IMS T805 75 

code IMS T222 420 

code IMST414 355 

code IMS T425 286 

code IMS T800 217 

code IMS T801 157 

code IMS T805 77 

IMS M212 467 

IMS T222 416. 418 

IMST414 351.353.376 

IMS T425 282. 284. 308 

IMST800 213,215.238 

IMS T801 153, 155 

IMST805 73,75.94 

program IMS T222 432 

program IMST801 167 
Bootstrapping 

IMST425 286 

IMST805 77 
Brackets 18 
Break point 

IMST425 286 

IMST805 77 
Buffer 

input IMS coil 517 

input IMS C01 2 541 

link 24 

output IMS coil 517 

output IMS C01 2 541 
Bus 31 

IMS coil 514 

IMSC012 538 
Byte 

access IMS C011 504 

access IMS C01 2 530 

access IMS T222 423,428 

access IMS T801 160 



561 



address IMS T222 420, 423 

address IMS T41 4 355 

address IMS T425 286 

address IMS T800 217 

address IMS T801 157, 160 

address IMS T805 77 
BYTE 17 

C 23 

Capacitive load 9 
CapMinus 

IMSC004 482 

IMSC011 506 

IMSC012 532 

IMST222 415 

IMST414 349 

IMST425 280 

IMST800 211 

IMST801 151 

IMST805 71 
CapPlus 

IMSC004 482 

IMST222 415 

IMST414 349 

IMST425 280 

IMST800 211 

IMS T801 151 

IMST805 71 
CASE 16 
CHAN OF 17 

protocol 114, 180, 250, 321, 388, 443 
Channel 8, 12, 13. 15, 17, 20, 23, 29, 114, 180, 
250,321,388,443 

communication 36, 38 

disl< hardware IMS M21 2 466 

empty 36 

event IMS T222 433 

event IMS T41 4 378 

event IMS T425 310 

event IMS T800 240 

event IMS T801 169 

event IMS T805 103 

external 36 

external IMS T222 408 

external IMST414 342 

external IMS T425 270 

external IMS T800 198 

external IMS T801 137 

external IMS T805 57 

IMST222 407 

IMST414 341 

IMST425 269 

IMST800 197 

IMST801 136 

IMST805 56 

input 29 

internal 36 

internal IMS T222 408 

internal IMST414 342 

internal IMS T425 270 



internal IMS T800 198 
internal IMS T801 137 
internal IMS T805 57 
link 23 

link IMS C004 485 
link IMS coil 509 
linklMSC012 535 
link IMS T222 434 
linklMST414 379 
link IMS T425 312 
link IMS T800 241 
link IMS T801 171 
link IMS T805 105 
memory 23 
Occam 23 

output 29 
process 29 

Characteristics 
AC timing IMS C004 
AC timing IMSC011 
AC timing IMSC012 
AC timing IMS T222 
AC timing IMST414 
AC timing IMS T425 
AC timing IMS T800 
AC timing IMS T801 
AC timing IMS T805 
DC electrical IMS C004 
DC electrical IMSC011 
DC electrical IMSC012 
DC electrical IMS T222 
DC electrical IMST414 
DC electrical IMS T425 
DC electrical IMS T800 
DC electrical IMS T801 
DC electrical IMS T805 

CLIP2D 43, 118, 184,254 

Clock 17,25 
input 24, 25 
input, internal IMS C004 
input, internal IMS C011 
input, internal IMS C012 
input, internal IMS T222 
input, internal IMS T414 
input, internal IMS T425 
input, internal IMS T800 
input, internal IMS T801 
input, internal IMS T805 
internal 24 
link 25 

link IMS C004 485 
link IMS coil 509 
linklMSC012 535 
link IMS T222 434 
linklMST414 379 
link IMS T425 312 
link IMS T800 241 
link IMS T801 171 
link IMS T805 105 
multiple IMS C004 482 



496 

521 

545 

439 

384 

317 

246 

176 

110 

494, 495 
519,520 
543, 544 
437, 438 
382, 383 
315,316 
244, 245 
174, 175 
108, 109 
324 



482 
506 
532 
415 
349 
280 
211 
151 
71 
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multiple IMSC011 506 

multiple IMSC012 532 

multiple IMST222 415 

multiple IMST414 349 

multiple IMS T425 280 

multiple IMST800 211 

multiple IMST801 151 

multiple IMS T805 71 

phase 1 1 

processor 40 

processor IMS T222 408 

processor IMS T41 4 342 

processor IMS T425 270 

processor IMS T800 198 

processor IMS T801 137 

processor IMS T805 57 

stability IMS C004 482 

stability IMS C01 1 506 

stability IMS C01 2 532 

stability IMS T222 415 

stability IMS T41 4 349 

stability IMS T425 280 

stability IMS T800 21 1 

stability IMS T801 151 

stability IMS T805 71 

timer 19 

timer IMS T222 408 

timer IMS T41 4 342 

timer IMS T425 270 

timer IMS T800 198 

timer IMS T801 137 

timer IMS T805 57 

transputer 1 1 
Clockln 

IMSC004 482 

IMS coil 506 

IMSC012 532 

IMST222 415 

IMST414 349 

IMST425 280 

IMST800 211 

IMS T801 151 

IMST805 71 

period IMS T222 422 

period IMST414 357 

period IMS T425 288 

period IMS T800 219 

period IMS T801 162 

period IMS T805 82 

sicew IMS coil 507 
Code 

function/operation IMS T222 409 

function/operation IMS T41 4 343 

function/operation IMS T425 272 

function/operation IMS T800 200 

function/operation IMS T801 139 

function/operation IMS T805 59 
Coding efficiency 

IMST222 405 

IMST414 339 



IMST425 267 

IMST800 195 

IMST801 134 

IMST805 54 
Colour 

display 43 

graphics 34 

text example 43 
Communication 8, 9, 10, 28, 31 , 36 

bandwidth 9 

channel 12,36,38 

construction 12, 15 

contention 9 

external 38 

frequency 1 1 , 24 

IMST222 407 

IMST414 341 

IMST425 269 

IMST800 197 

IMST801 136 

IMST805 56 

interface 10 

internal 36 

language 23 

link 9,39 

parallel IMS C01 1 504 

parallel IMSC012 530 

process IMS T222 407 

process IMS T414 341 

process IMS T425 269 

process IMS T800 197 

process IMS T801 136 

process IMS T805 56 

speed 30 
Comparison operator 114, 117, 180, 183, 250, 

253,321,388.443 
Compatibility 

IMST425 263 

IMST805 49 
Concept 29 
Concurrency 7, 12, 28 

IMST222 406 

IMST414 340 

IMST425 268 

IMST800 196 

IMST801 135 

IMST805 55 

internal 12 

support 34 
Concurrent 

FPU/CPU operation 41 

process 12, 14, 15, 22, 29 

systems 12 
Conditional construction 14, 15 
Configuration 

coding IMST414 371 

coding IMS T425 302 

coding IMS T800 233 

coding IMS T805 101 

memory IMS T222 418 
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memory IMS T41 4 353,357,366 




Conversion 


memory IMS T425 284. 288, 297 




INT, REAL 117, 183, 253 


memory IMS T800 215,219,228 




REAL, INT 117, 183, 253 


memory IMS T801 155 






CPU 31 


memory IMS T805 75, 79, J 


81,96 




concurrent operation 41 


memory, external IMS T414 


366, 368. 


369, 


register 31, 32 


370, 372 






CRC 


memory, external IMS T425 


297, 299, 


300, 


IMS M212 465,467 


301 , 303 






IMST425 262 


memory, external IMS T800 


228, 230, 


231, 


IMST800 191 


232, 234 






IMST801 129 


memory, external IMS T805 


98,99, 100, 102 


IMST805 49 


memory, internal IMS T414 


366 




performance 118,1 84, 254, 324 


memory, internal IMS T425 


297 




CRCBYTE 118,1 84, 254, 324 


memory, internal IMS T800 


228 




CRCWORD 118,1 84, 254, 324 


memory, internal IMS T805 


96 




Cyclic redundancy 


program 20 






IMSM212 465,467 


Connection 






IMST425 262 


link IMS C004 485 






IMST800 191 


link IMS coil 509 






IMST801 129 


linklMSC012 535 






IMST805 49 


link IMS T222 434 






performance 118,1 84. 254, 324 


linklMST414 379 








link IMS T425 312 






DO-7 


link IMS T800 241 






IMS coil 508.514 


link IMS T801 171 






IMSC012 534.538 


link IMS T805 105 






DABS 117.183.253 



Constant 1 15, 181 , 251 . 322. 389. 444 

subscript 114, 180, 250, 321 , 388, 443 

value 33 
Construction 1 4, 29, 1 1 5, 1 81 , 251 , 322, 389, 
444 

alternation 14, 15, 16,29,40 

communication 15 

conditional 14, 15 

parallel 12,14,20,29,35 

parallel IMS T222 407 

parallel IMST414 341 

parallel IMS T425 269 

parallel IMS T800 197 

parallel IMS T801 136 

parallel IMS T805 56 

performance 118, 184. 254, 325, 391 , 446 

repetition 16 

replication 16 

selection 16 

sequential 12, 14, 15, 29 
Context switch 

IMST425 286 

IMST805 77 
Control 

byte IMS T222 418 

byte IMST414 351,353 

byte IMS T425 284 

byte IMS T800 215 

byte IMS T801 155 

byte IMS T805 75 

link IMS C004 484 

logic IMS M21 2 466 



Data 
bit 39 

bus IMS T222 422. 423 
buslMST414 359 
bus IMS T425 290 
bus IMS T800 221 
bus IMST801 159. 160 
buslMST805 80 
link 39 

link IMS C004 485 
link IMS coil 509 
linklMSC012 535 
link IMS T222 434 
linklMST414 379 
link IMS T425 312 
link IMS T800 241 
link IMS T801 171 
link IMS T805 105 
rate 24. 39 
rate IMS T222 400 
rate IMS T225 454 
rate IMST414 335 
rate IMS T425 262 
rate IMS T800 191 
rate IMS T801 129 
rate IMS T805 49 
rate link IMS T222 434 
rate link IMST414 379 
rate link IMS T425 312 
rate link IMS T800 241 
rate link IMS T801 171 
rate link IMS T805 105 
read IMS coil 518 
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read IMS C012 542 

separation IMS M212 465 

serial 39 

structure 33 

structure IMS T222 404 

structure IMST414 338 

structure IMS T425 266 

structure IMS T800 194 

structure IMS T801 133 

structure IMS T805 53 

transfer 15 

value 32 

value IMS T222 404 

value IMST414 338 

value IMS T425 266 

value IMS T800 194 

value IMS T801 133 

value IMS T805 53 

write IMS coil 518 

write IMSC012 542 
Data Present 

IMS coil 514,517,518 

IMSC012 538,541,542 
Declaration 17, 114, 180, 250, 321, 388, 443 
Decoupling 

IMSC004 482 

IMS coil 506 

IMSC012 532 

IMST222 415 

IMST414 349 

IMST425 280 

IMST800 211 

IMST801 151 

IMST805 71 
Delay 

input 19 

timer 19 
Deschedule 35, 37, 38 

IMS T222 406, 407, 408 

IMST414 340,341,342 

IMS T425 268, 269, 270 

IMST800 196, 197, 198 

IMST801 135, 136, 137 

IMST805 55,56,57 

point IMS T222 407,410,418 

point IMS T414 341,344,353 

point IMS T425 269, 273, 284 

point IMS T800 197,201,215 

point IMS T801 136, 140, 155 

point IMS T805 56, 60, 75 
Device 29 
Direct function 33 

IMST222 404 

IMST414 338 

IMST425 266 

IMST800 194 

IMST801 133 

IMST805 53 
Direct memory access 

IMST222 423 



IMST414 358 

IMST425 289 

IMST800 220 

IMST801 159 

IMST805 79 
DisablelntRAM 

IMST222 420 

IMST414 355 

IMST425 286 

IMST800 217 

IMST805 77 
Disk 

command IMS M21 2 467,468 

compression/decompression IMS M212 471 

controller IMS M21 2 464,465 

cylinder IMS M212 467 

drive selection IMS M212 464 

encryption/decryption IMS M212 471 

floppy IMS M21 2 464,467 

format IMS M21 2 467 

head IMS M212 467 

head position IMS M212 464 

interleave IMS M212 468 

management IMS M21 2 471 

parameter IMS M212 467 

portlMSM212 465 

programming interface IMS M212 467 

SA400/450 IMS M212 465, 467 

sector IMS M212 467 

ST506/412 IMS M212 465,467 

Status IMSM212 464 

Winchester IMS M21 2 464,467 
DMA 

at reset IMS T222 431,432 

at reset IMS T41 4 377 

at reset IMS T425 309 

at reset IMS T800 239 

at reset IMS T801 167 

at reset IMS T805 95 

IMS T222 423, 424, 431 

IMST414 358.376 

IMS T425 289, 308 

IMS T800 220, 238 

IMST801 159, 161, 167 

IMST805 79,94 

operation IMS T222 431 

operation IMST414 377 

operation IMS T425 309 

operation IMS T800 239 

operation IMS T801 167 

operation IMS T805 95 
DRAW2D 43, 118, 184, 254, 324 
DSQRT 117, 183,253 

FCC 

~IMSM212 465,467 

Efficiency 34 

Electrical 
AC timing characteristics IMS C004 496 
AC timing characteristics IMS C01 1 521 
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AC timing characteristics IMS C012 545 
AC timing cliaracteristics IMS T222 439 
AC timing cliaracteristics IMS T414 384 
AC timing characteristics IMS T425 31 7 
AC timing characteristics IMS T800 246 
AC timing characteristics IMS T801 176 
AC timing characteristics IMS T805 1 1 
DC characteristics IMS C004 494, 495 
DC characteristics IMS C01 1 51 9, 520 
DC characteristics IMS C0 12 543,544 
DC characteristics IMS T222 437, 438 
DC characteristics IMS T41 4 382,383 
DC characteristics IMS T425 315,316 
DC characteristics IMS T800 244, 245 
DC characteristics IMS T801 1 74, 1 75 
DC characteristics IMS T805 108, 109 
operating conditions IMS C004 494 
operating conditions IMS C01 1 519 
operating conditions IMS C012 543 
operating conditions IMS T222 437 
operating conditions IMS T414 382 
operating conditions IMS T425 315 
operating conditions IMS T800 244 
operating conditions IMS T801 174 
operating conditions IMS T805 108 
specification 24 

EMI 
IMST222 422 
IMST414 357 
IMST425 288 
IMST800 219 
IMST801 159 
IMST805 79 

EnableJOBreak 
IMST425 284 
IMST805 75 

Equivalent circuit 
IMSC004 495 
IMSC011 520 
IMSC012 544 
IMST222 438 
IMST414 383 
IMST425 316 
IMST800 245 
IMST801 175 
IMST805 109 

Erastosthenes 1 1 9, 1 84, 255, 325, 391 , 446 

Error 25 
IMST222 418,419 
IMST414 353,354 
IMST425 284.285 
IMST800 215,216 
IMST805 75,76 

Error 25 
IMST222 419 
IMST414 354 
IMST425 285 
IMST800 216 
IMST801 155,156 
IMST805 76 



power up IMS T222 419 

power up IMS T41 4 354 

power up IMS T425 285 

power up IMS T800 216 

power up IMST801 156 

power up IMS T805 76 
Error 25 

analysis 21 

analysis IMS T222 419 

analysis IMST414 354 

analysis IMS T425 285 

analysis IMS T800 216 

analysis IMS T801 156 

analysis IMS T805 76 

circuit IMS T222 419 

circuit IMS T41 4 354 

circuit IMS T425 285 

circuit IMS T800 216 

circuit IMS T801 156 

circuit IMS T805 76 

correcting code 34 

correcting code IMS M21 2 465,467 

expression check 115, 1 81 , 251 , 322, 389, 
444 

floating point IMS T800 210 

floating point IMS T801 150 

floating point IMS T805 70 

handling 21 

IMSM212 467 

languages 21 

reset IMS T222 419 

reset IMS T41 4 354 

reset IMS T425 285 

reset IMS T800 216 

reset IMS T801 156 

reset IMS T805 76 
Errorin 

IMST425 285 

IMST800 216 

IMST805 76 
ErrorOut 

IMS T801 156 
Evaluation 

expression IMS T222 403, 405 

expression IMS T414 337, 339 

expression IMS T425 265, 267 

expression IMS T800 193, 195 

expression IMS T801 132, 134 

expression IMS T805 52, 54 

stack 31 , 32, 36 

stack IMS T222 403, 407, 408 

stack IMS T414 337, 341 , 342 

stack IMS T425 265, 269, 270 

Stack IMS T800 193,197,198 

stack IMS T801 132, 136,137 

stack IMS T805 52,56,57 
Event 16,26 

IMST222 433 

IMST414 378 

IMST425 310 
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IMST800 240 

IMST801 169 

IMST805 103 
EventAck 

IMST222 433 

IMST414 378 

IMST425 310 

IMST800 240 

IMST801 169 

IMST805 103 
EventReq 

IMST222 419,433 

IMST414 354, 378 

IMST425 285,310 

IMST800 216, 240 

IMST801 156,169 

IMST805 76, 103 
EventWaiting 

IMST425 310 

IMST801 169 

IMST805 103 
Example 

drawing coloured text 43 

instruction set IMS T222 409 

instruction set IMS T414 343 

instruction set IMS T425 272 

instruction set IMS T800 200 

instruction set IMS T801 139 

instruction set IMS T805 59 
Execution 

instruction IMS T222 404 

instruction IMST414 338 

instruction IMS T425 266 

instruction IMS T800 194 

instruction IMS T801 133 

instruction IMS T805 53 
Expression 12, 18, 29, 114, 115. 117, 180, 181, 
183, 250, 251, 253, 321, 322, 388, 389, 
443, 444 

evaluation IMS T222 403, 405 

evaluation IMS T414 337, 339 

evaluation IMS T425 265, 267 

evaluation IMS T800 193, 195 

evaluation IMS T801 132, 134 

evaluation IMS T805 52, 54 

subscript 1 1 4, 1 80, 250, 321 , 388, 443 
External 

memory interface IMS T222 422 

memory interface IMS T414 357 

memory interface IMS T425 288 

memory Interface IMS T800 219 

memory interface IMS T801 159 

memory interface IMS T805 79 

memory performance 118, 184, 254, 324, 
391,446 

registers 19 



Factorial 18 
FALSE 18 

Flash multiplier 



42 



Floating point 28, 40 

address 41 

co-processor 42 

comparison 42 

concurrency IMS T800 209 

concurrency IMS T801 149 

concurrency IMS T805 69 

concurrent operation 41 

datapath 42 

design 40, 42 

division 42 

double length IMS T800 209 

double length IMS T801 149 

double length IMS T805 69 

error IMS T800 210 

error IMS T801 150 

error IMS T805 70 

functions 117, 183, 253 

instruction 34, 40 

microcode 42 

multiplication 41 , 42 

normalise IMS T800 209 

normalise IMS T801 149 

normalise IMS T805 69 

operand 41 

performance 113, 117, 179, 183, 249. 253, 
323, 390, 445 

processor 31 , 40 

processor IMS T800 190, 209 

processor IMS T801 128, 149 

processor IMS T805 48. 69 
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format IMS T41 4 338 
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process IMS T800 196, 197 

process IMS T801 135, 136 

process IMS T805 55, 56 
Literal value 33 

IMST222 404 

IMST414 338 

IMST425 266 

IMST800 194 

IMST801 133 

IMST805 53 
Livermore loop 41 
Load 

capacitive 9 

instruction IMS T222 404 

instruction IMST414 338 

instruction IMS T425 266 

instruction IMS T800 194 

Instruction IMS T801 133 
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early IMS T805 98 




IMST414 349 




latelMST414 368 




IMST425 280 




late IMS T425 299 




IMST800 211 




late IMS T800 230 




IMST801 151 




late IMS T805 98 




IMST805 71 




strobe IMS T222 424 




VLSI 29,45 




Strobe IMST414 363 
strobe IMS T425 294 




Wait 




strobe IMS T800 225 




IMST222 429 




strobe IMS T805 80 





IMS T414 357. 362. 369. 374. 375 

IMS T425 288. 293. 300. 306, 307 

IMS T800 219, 224, 231 , 236, 237 

IMST801 165 

IMST805 79,89.90,99 

State generator IMS T222 429 

state generator IMS T801 165 

state IMS T222 423 

state IMS T801 159 
Whetstone benchmark 41 
WHILE 16, 29, 115, 181, 251, 322. 389. 444 
Word 

access IMS T222 428 

length 31,32,34 
Workspace 35, 37 

disk IMS M212 467 

IMST222 403,407,418 

IMST414 337,341,351 

IMST425 265,269,284 

IMST800 193, 197,215 

IMST801 132,136,155 

IMST805 52,56,75 

instruction IMS T222 407 

instruction IMST414 341 

instruction IMS T425 269 



The Transputer Databook contains a description of the 

transputer architecture and engineering data for the 
following members of the transputer family: 

IMS T805 32 bit transputer with on-chip 

floating point unit 
IMS T801 32 bit transputer with on-chip 

floating point unit 

IMS T800 32 bit transputer with on-chip 

floating point unit 

IMS T425 32 bit transputer 

IMS T414 32 bit transputer 

IMS T222 16 bit transputer 

IMS T225 16 bit transputer preview 

IMS M212 intelligent disc drive controller preview 

and communication devices 

IMS C011 communications link adaptor 
IMS C012 communications link adaptor 
IMS C004 communications link switch 



